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LEUKEM1A/LYMPH0MA CHIP #3: PROBES USED IN ANALYSIS 
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GREEN: Tumor probes where gene expression would be desired. 

RED: Normal essential tissue probes where gene expression is to be avoided. 

BLACK : Normal tissue probes where gene expression is acceptable. 
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Lyl447 sequence: 

CCTTCAACAGACTGGATGGGGTCCACCCACATTAGGGTGGAGTTTGTTCT 
GGTACCTTACTCTCCCAATGAGAAATTCTAAATTTCCTCTTTTGAGCCTGG 
CGTGAGCTTTGTTGGTGATGTGACCTCCAGAACAGACTGTGA^ 
CTGTCCTTCTTGACTTTCAACTAGGCCTAAAAAAGCGN 

Lyl448 Sequence 

CCAGGGTTAGGAGATTCAAATGTCTTTATC^ 

TTATAATGGATTATCTATACAATCTCC^CATTCCCACATTTTGCATTAGAGAATGG 
GTTTCCCTTAGAGTTCTCACCTGTTGTCTTATATCCATCTAGGAATCCCCATCT 
CCCGGGGGACAGGTGACTGAAGGACAAAAACTGATCCTGCT 
CTGGTACAGAGAGGCCACAGGAACCAGTATGGGAAAGAAAACCC^ 

TGAAAGAGAGTGATGCCGGCAAATATTAOTGTAGAGCTGAC^CGGCCATGTGCCTATCCAGAGCAAGGTGGTC 
CCTGTGAGAAGACCTGATGGCTATAGAAGAGACCTCATGACAG 

Lyl449 Sequence 
CCACATTGGGAATTCTGCAGACAGGTC 

GGCCGGACCTCTGCAGGGAGGTGAGGCCCCTGCCCCCATC^^ 

ctgggaggggagaagtgctgataaaaaggccaaaaatcacacaga^^ 

gtgctaccgtattcctacgagcacggggccix5tgttgaagactccct 

agcctcacagcctctttctcagatgcagtcacc^ 

CCTGTTTGTGGCCTGGCTGACAGATATTTTC 
GTCAGCACTCANGGATTCTGTCTTT 

Lyl451 Sequence 

CTGCTGOTAAATACCTNTGAGAAACTCTGCTTCTATCTAAGGGGACCT 
GAACCTCCTANACGGACCCITTGGCATAATGAATTGGACCAACTC 

T&AACAATCTCACCCAATTACTCTGCT CGAGGCAG CCCCTTAGGCGGCTTAGG 

Lyl452 Sequence 

AAAANACCAATAGCAGCCAAAACAGAACATTTGTAAACAAAACCA 
ATTCTTTTGAAACATTAAGTATATGCAATAAAGAGAATATAGACCATCTTTT^ 
AACAATGTCACGAATAATAGACAGAAATCGGTGTTAT 
TCATGGGGCTGCAGAGGCCACACTTTCCAGAAGTTTTCT 

GGAAGCACTGTTTGTGAAAGCAAGCCCTGCACGCTGAACATCTTCTACATGTTGCT 
GTATTCTGGCAAAATTCACAAATAAAGCCCTTTCC^ 

AATGTCCTTGAGTAAGGGTGCCAGCAGCCCTTTCTTGATCCTGACCAGGTCCT 

TCAAGTGTCCCGGCCCTGCCCTACAAACTCCTTTA^ 

GGAAGAGCTGCTCCTGAATTTCCTTC 

Lyl453 Sequence 

CCAAAAACTTCAGCATAAAAACTATCCTGTCTGTGTATTATATATATTTATTA 

TGAACCAGGTAATTTTATCTAGTGTGAAAAGAAATTAATCA^ 

TTAAAATATTTCATAATTGGAGGGAATCTTTGGAGAT^ 

TATTGACCAGTGGAAGTTGATGATTTGTC 

TTCAGTNCCTTTAC^GTTATTTGAAATTATTTCT^ 

AAAATGTGGAGAATGC^TTAATCATTATTTAATCAATGAGT^ 

Lyl454 Sequence 

CCTCTCCAC^GGCTGCITGAGTGTCCTCATG^ 

CAGAAGCTACCAAATCITTATGTTTGAAGTCATGCACC^ 

TTCAGTCTGGGAGGGAATTGCAC^GGGCATGAATACTCCACT^ 

AATGAGCAAATGAATGCACAGATAGAATATTAGCAGTGACAATGATGCTAG^ 

TTCTCCCCCAACCCTCrCCTGCTCCCAGGCAAGAAGCCCTCTA 

GGGAACCTATTCCGCCXSTGGGAC^GTGTTAATTAGTGGAAAA 

TTACCTGCTGATCACTGTCCAGACTTCTGGAGGAC^ 

CATCCCirCCCTCCAGTATATTAGAGTTACXSTAAATTCTTAAAAT 

Lyl45€ Sequence 

AAAGACATGAAAAATATCCCAAGATCATACTAGATCATAAT^ 

TCTAACAGTTTCCTTCATGTTGTTTTAATGCACAAGGGCAGA 

GTGCTATAGACACTTCATGGTGTCGTACCCAG^ 

TAGTAGCCCACCCCCCGCTGCTCCAAAACTTGAGTTGTT^ 

CAGAAAGCCTGGAGAGAGGCCAAGATGCCACCCACC 

TAAGTGCCGCTTTGCTTCTTCCTCTCTO 

AAAATGACTGATCCCACCGCCAGTGACTCA 

Lyl458 Sequence 

TTGAATGCTCTATTTTGCmTAANNNTTTATTNAC^ 

GGGTTANAATCAAANAAACATANAGTNGGGCAATATACTTNATCCTACCCATCCCACCCA^ 
ATTCTCATTAATTTTGGGAAAN»TNANAANATGGGTTCGTTG^ 
CCAACACCCCATGCCCAGGGGGGNCACCCTCCAATACAA 
ATCTGCCATCATNTTCCCATNTTCCAGNCTNTTTCCATTC 

Figure 9 
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Lyl464 Sequence 

aaagagaactaatggaagtggattgaatacagcagtctcaactgggggcaattttgcccc 
ccagaggacattgggcaatgtttggagacattttggtcattatacttggggggttggggg 
atggtgggatgtgtg 

Iiyl480 Sequence 
CTGTAGCCTCTGC^GTGAAAATCCAG^^ 

ACAGGCAGGGCTGGCTGATAGGGAGTATGGGAGAAGGACACGCTCAGATGAAAA.CATG CATG CAACG ATTTTCACCACTG 

AACAC&CTGTTTTCTGTGATAGAAACTGTCGGCCCTGCTGGC^ 

CACCAGGGCGGCCTGCCCCTGATGCTCCTTTGTTACCTGCT 

AGTGAGCCCAGACCCCAGGGAAAAGCTTGGGTAGAACAATCCAAGGGGC^GCC 

CTGCTCTAGAAGCCTGGAGG 

Lyl482 Sequence 

AAAAACAATCTACAGGCAGTTCTTTACAAGTCTCATATTT^ 

CTAAATATACGATTCTTTGGCATATTGGAATTGGTGAGCCTCAAAGAC CGCACGAGACAGT C CC 

GCTTATTCCTCTGCACGX3&CTCGGAGACGGTCCT 

GAGTCCCCAGAAGCATGGACAGTTCTGCTCTGTTTCCATCGCTCAGGCAGGGGAGAGAGTCCGTGG 



Lyl483 Sequence 

GCTGGAGCTTCACTGTGAGTCCCTGAGAGGCTCCTTCCCGATCCTGTACTGGTTTTATCACGAGGATGACACCTTGGGG^ 
ACATCTCGGCCCAOTCTGGAGGAGGGGCATCCTTCAACCTCTCTCTGACT 

GC t TGACAATGGCCTGGGGGCC CAGCACAGTAAAGTGGTGACACTCAATGTTACAGGAACTT CCAGGAACAGAACAGGC C 

TTACCRCTGCGGGAATCACGGGGCTGGTGSTCARCATCYTCGTCCTTGYVJGCTGCTGCTGCTCTGCTGCATTACGCCAGG 

GCCCGAAGGAAACCAGGAGGACTTTCTGCCACTGGAACATCTAGTCACAGTCCTAGTG 

CAGG 

Lyl484 Sequence 

CTGGGCATCACC^GTCACAGTTTCCAGCGTGCTGCTC^GCCCTCCGAGTGTGTGTGCTCATCCTTTTCATAGAAGTCCC 

ATCGGCCATGGAGAGGGTTGGGCTGCAGAGCTGTGATTGCCAGAGGCCCTTCCTTGAGAACTGTGGGGAAGGAGGCCCTG 

GGGGTTTCTTCTGTAGGCAGAGCTCAGGCCCCAGTCACCTCTGCCA 

CTGCCTCTCTCTTCCTACCCATCTGCAGACC^GCAGAATAT^ 

TCTGGACACGGCCAGAGCAgGTGACTGCGGGGCTGGTTTTGOTGGGCTTCCCT 

CGACTCAAGGCCAGGTTCTTCTCCCACTTGTGGCCTCTTGGGCTTGGAGGCTGAGCCAGGGGCTCCTCTCCTGCTGGCCG 

TCCAGGAAGASgACATCTTCAC^TCCTCAGTCTTC 

CTTGTGTCCCATGCTGGGCCCTCCATCAGGCAGCAACAGG 

Lyl485 

CTGTCTCCACTGTGGAGTTACTATTTTT^ 

CAGGACAGGGGAATTAAGTGCCACCTTCTGGAGAGGGAGCAT^ 

TGTTTCTCCTCTAAAAAACTCTTTAATCCTTTTAAGCCTCA^ 

CCAAGGGAGTAGTTAGAAGGTAACATGATAGGTGGAAAGCACITAACATAGGC 

AGACCGATCCAACTATCTGAAGGAGTCACTTAACTCTACTGTACTGC^ 

GTAAAGGCCCCCAGTCCCTGAGACGGGCCAGTTTGGAaGACAGGCTG^^ 

TGAGAAGGGAGGTCTGGAGACAGAATGCCAAAAGCCCATTAAAOT 

AGAAGAAC CAGAGGAGCTCAG 

Lyl488 

CTGGYCTGTGGGTGATCCCAGCTCTTACTAGGAGAGGGAASTGAaGGTCYTGGTGCCAGGGGC 
AAACAGTCCAGGATGGAACCTGGTTCACCCTTCATACCAGCT 

CCTTGAGAACTTGGCCCTGTGCTTTAGACCCMGGACCCGATTCCTGGGCTAGGAAAGAGAGAACAAGCAAGCCGG 
ACCTGCCCCCAGGTGG 
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Figure la. Lyl464 full length DNA sequence I£>l>iTM O x S°CI 

1 gatgcaagga gatgagacag ttaaatttac ttcctctttt ctaatctgag aggtttcatg 
61 ttgaagaaaa tcagtgttgg ggttgcagga gacctaaaca cagtcaccat gaagctgggc 
121 tgtgtcctca tggcctgggc cctctacctt tcccttggtg tgctctgggt ggcccagatg 
181 ctactggctg ccagttttga gacgctgcag tgtgagggac ctgtctgcac tgaggagagc 
241 agctgccaca cggaggatga cttgactgat gcaagggaag ctggcttcca ggtcaaggcc 
301 tacactttca gtgaaccctt ccacctgatt gtgtcctatg actggctgat cctccaaggt 
361 ccagccaagc cagtttttga aggggacctg ctggttctgc gctgccaggc ctggcaagac 
421 tggccactga ctcaggtgac cttctaccga gatggctcag ctctgggtcc ccccgggcct 
481 aacagggaat tctccatcac cgtggtacaa aaggcagaca gcgggcacta ccactgcagt 
541 ggcatcttcc agagccctgg tcctgggatc ccagaaacag catctgttgt ggctatcaca 
601 gtccaagaac tgtttccagc gccaattctc agagctgtac cctcagctga accccaagca 
661 ggaagcccca tgaccctgag ttgtcagaca aagttgcccc tgcagaggtc agctgcccgc 
721 ctcctcttct ccttctacaa ggatggaagg atagtgcaaa gcagggggct ctcctcagaa 
781 ttccagatcc ccacagcttc agaagatcac tccgggtcat actggtgtga ggcagccact 
841 gaggacaacc aagtttggaa acagagcccc cagctagaga tcagagtgca gggtgcttcc 
901 agctctgctg cacctcccac attgaatcca gctcctcaga aatcagctgc tccaggaact 
961 gctcctgagg aggcccctgg gcctctgcct ccgccgccaa ccccatcttc tgaggatcca 
1021 ggcttttctt ctcctctggg gatgccagat cctcatctgt atcaccagat gggccttctt 
1081 ctcaaacaca tgcaggatgt gagagtcctc ctcggtcacc tgctcatgga gttgagggaa 
1141 ttatctggcc accggaagcc tgggaccaca aaggctactg ctgaatagaa gtaaacagtt 
1201 catccatgat ctcacttaac caccccaata aatctgattc tttattttct cttcctgtcc 
1261 tgcacatatg cataagtact tttacaagtt gtcccagtgt tttgttagaa taatgtagtt 
1321 aggtgagtgt aaataaattt atataaagtg agaattagag tttagctata attgtgtatt 
1381 ctctcttaac acaacagaat tctgctgtct agatcaggaa tttctatctg ttatatcgac 
1441 cagaatgttg tgatttaaag agaactaatg gaagtggatt gaatacagca gtctcaactg 
1501 ggggcaattt tgccccccag aggacattgg gcaatgtttg gagacatttt ggtcattata 
1561 cttggggggt tgggggatgg tgggatgtgt gtgctactgg catccagtaa atagaagcca 
1621 ggggtgccgc taaacatcct ataatgcaca gggcagtacc ccacaacgaa aaataatctg 
1681 gcccaaaatg tcagttgtac tgagtttgag aaaccccagc ctaatgaaac cctaggtgtt 
1741 gggctctgga atgggacttt gtcccttcta attattatct ctttccagcc tcattcagct 
1801 attcttactg acataccagt ctttagctgg tgctatggtc tgttctttag ttctagtttg 
1861 tatcccctca aaagccatta tgttgaaatc ctaatcccca aggtgatggc attaagaagt 
1921 gggcctttgg gaagtgatta gatcaggagt gcagagccct catgattagg attagtgccc 
1981 ttatttaaaa aggccccaga gagctaactc acccttccac catatgagga cgtggcaaga 
2041 agatgacatg tatgagaacc aaaaaacagc tgtcgccaaa caccgactct gtcgttgcct 
2101 tgatcttgaa cttccagcct ccagaactat gagaaataaa attctgttgt ttgt 



Figure lb.Lyl464 protein sequence ( V<ft>\ W * ® {Ftffj 



MKLGCVLMAWALYLS LGVLWVAQMLLAAS FETLQCEGPVCTEES SCHT EDDLTDAREAGFQVKAYTF SE P FHL I VS YDWLI LQGPAKP V 
FEGDLLVLRCQAWQDWPLTQVTFYRDGS ALGPPG PNREFS I TWQKADSGHYHCSGIFQS PG PG I PETAS WAITVQELFPAP ILRAVP 
SAEPQAGSPMTLSCQTKLPLQRSAARLLFS FYKDGRI VQSRGLS SEFQI PTASEDHSGS YWCEAATEDNQVWKQSPQLEIRVQGAS SS A 
APPTI^PAPQKSAAPGTAPEEAPGPLPPPPTPSSEDPGFSSPLGMPDPHLYHQMGLL^ 
TAE 
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TMpred Report for Ly1464 

Date: 

MKLGCVL^SAWALYLSLGVLWVAQMLLAASFETLQCEGPVCTEESSCHTED 
DLTDAREAGFQVKAYTFSEPFHLIVSYDWklLQGPAKPVFEGDLLVLRCQ 
AWQDWPLTQVTFYRDGSALGPPGPNREFSITWQKADSGHYHCSGIFQSP 
GPGiPETASWAITVQEXiFPAPILRAVPSAEPQAGSPMTLSCQTKLPLQR 
SAARLLFSFYKDGRIVQSRGLSSEFQIPTASEDHSGSYWCEAATEDNQVW 
KQSPQLEIRVQGASSSAAPPTLNPAPQKSAAPGTAPEEAPGPLPPPPTPS 
SEDPGFSSPLGMPDPHLYHQMGLLLKHMQDVRVLLGHLLMELRELSGHRK 
PGTTKATAE 

Black - intracellular/ Red = Transmembrane, Blue = Extracellular 

Lyl464 has 358 amino acids and 3 Transmembrane Domains 

Transmembrane Domain 1: 3 - 24 Score: 1.5484 
Transmembrane Domain 2 : 71 - 92 Score: 1.3009 
Transmembrane Domain 3: 158 - 179 Score: 1.3027 
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Ly 1464 MHC class binding peptides(S€dXD ND*' IDtffi 

Lyl464 Al binding peptides 



RANKING 


SEQUENCE 


STARTING POSITION 


THEORETICAL SCORE 


1 


ATEDNQVWK 


243 


90 


2 


ASEDHSGSY 


230 


67.5 


3 


FSEPFHLIV 


67 


67 . 5 


4 


LTDAREAGF 


52 


25 


5 


SSEDPGFSS 


300 


6.75 


6 


ELFPAPILR 


167 


5 


7 


SSEFQIPTA 


222 


2.7 


8 


MPDPHLYHQ 


312 


2.5 


9 


LMELRELSG 


339 


2.25 


10 


HTEDDLTDA 


47 


2.25 


11 


CTEESSCHT 


40 


2.25 


12 


ALGPPGPNR 


118 


2 


13 


LGMPDPHLY 


310 


1.25 


14 


CVLMAWALY 


5 


1 


15 


KADSGHYHC 


135 


1 


16 


LRELSGHRK 


342 


0.9 


17 


QLEIRVQGA 


255 


0.9 


18 


WCEAATEDN 


239 


0.9 


19 


SAEPQAGSP 


179 


0.9 


20 


AREAGFQVK 


55 


0.9 



Lyl464 HLA A2 binding peptides 

RANKING SEQUENCE STARTING POSITION THEORETICAL SCORE 



1 


YLSLGVLWV 


13 


4047 


2 


LLLKHMQDV 


323 


1006 


3 


VLMAWALYL 


6 


739 


4 


VLWVAQMLL 


18 


301.4 


5 


MLLAASFET 


24 


271.9 


6 


ILQGPAKPV 


81 


118.2 


7 


LLGHLLMEL 


334 


83.53 


8 


LLAASFETL 


25 


33.81 


9 


WQDWPLTQV 


102 


29.84 


10 


YTFSEPFHL 


65 


29.29 


11 


LIVSYDWLI 


73 


18.29 


12 


KLPLQRSAA 


195 


17.39 


13 


WALYLSLGV 


10 


16.44 


14 


CQAWQDWPL 


99 


16.24 


15 


GIPETASW 


153 ' 


16.08 


16 


QVWKQSPQL 


248 


15.51 


17 


ALYLSLGVL 


11 


8.38 


18 


QMGLLLKHM 


320 


8.252 


19 


WAITVQEL 


160 


7.309 


20 


GVLWVAQML 


17 


6.916 



Lyl464 A3 binding peptides 



RANKING 


SEQUENCE 


STARTING POSITION 


THEORETICAL SCORE 


1 


WLILQGPAK 


79 


30 


2 


ELFPAPILR 


167 


18 


3 


ALGPPGPNR 


118 


9 


4 


VLWVAQMLL 


18 


6 


5 


VLMAWALYL 


6 


5.4 


6 


LLKHMQDVR 


324 


4 


7 


KLGCVLMAW 


2 


2.7 


8 


PMTLSCQTK 


187 


2 


9 


LLGHLLMEL 


334 


1.8 


10 


GMPDPHLYH 


311 


1.8 


11 


PLTQVTFYR 


106 


1.8 


12 


LLAASFETL 


25 


1.8 


13 


HLYHQMGLL 


316 


1.35 
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14 


ALYLSLGVL 


11 


1.35 


15 


YLSLGVLWV 


13 


1.2 


16 


ATEDNQVWK 


243 


1 


17 


HMQDVRVLL 


327 


0.9 


18 


HLIVSYDWL 


72 


0.9 


19 


KAYTFSEPF 


O J 


0 . 9 


20 


KQSPQLEIR 


251 


0 .81 


Lyl464 All binding peptides 






RANKING 


SEQUENCE 


^TARTTNfi POSITION 


THEORETICAL SCORE 


1 


ATEDNQVWK 


243 


1 


2 


WLILQGPAK 


79 


0.6 


3 


KQSPQLEIR 


251 


0 . 36 


4 


CQTKLPLQR 


192 


0 .24 


5 


RVLLGHLLM 


332 


0 . 18 


6 


PTLNPAPQK 


270 


0 .15 


7 


ELFPAPILR 


167 


0.096 


8 


ARLLFSFYK 


203 


0 . 09 


9 


GVLWVAQML 


17 


0.09 


10 


LLKHMQDVR 


324 


0.08 


11 


ALGPPGPNR 


118 


0.08 


12 


GHRKPGTTK 


347 


0.06 


13 


RVQGASSSA 


259 


0.06 


14 


LPLQRSAAR 


196 


0.06 


15 


EFSITWQK 


XZ i 


0 . 06 


16 


YTFSEPFHL 


Dj 




17 


ITVQELFPA 


1 CI 




18 


YHQMGLLLK 


jJLO 


0 . 04 


19 


QVWKQSPQL 


O A O 


U . U*i 


20 


LFSFYKDGR 


one 




Lyl464 A24 binding peptides 






RANKING 


SEQUENCE 






1 


LYHQMGLLL 


31/ 


zuu 


2 


VFEGDLLVL 


fi Q 
O J 


^o 


3 


TFYRDGSAL 


111 


zu 


4 


GFQVKAYTF 


C Q 

33 


15 


5 


KPVFEGDLL 




XZ 


6 


HMQDVRVLL 


Q *5 *7 

Jz / 


i n nfi 

1U » uo 


7 


GVLWVAQML 


1 *7 


10 Oft 


8 


LYLSLGVLW 


12 


Q 


9 


IFQSPGPGI 


146 


7.5 


10 


APEEAPGPL 


285 


7.2 


11 


TFSEPFHLI 


66 


7.2 


12 


HLLMELREL 


337 


6.6 


13 


MTLSCQTKL 


188 


6.6 


14 


VLMAWALYL 


6 


6 


15 


HLIVSYDWL 


72 


b 


16 


GCVLMAWAL 


A 
*± 


c 
o 


17 


LLGHLLMEL 


334 


c Oft 


18 


WAITVQEL 


160 


C O Q 


19 


MAWALYLSL 


Q 

o 


4 Q 
. O 


20 


DVRVLLGHL 




4 ft 


Lyl464 A68 binding peptides 






RANKING 


SEQUENCE 


o 1*VKI Xl\l\j xrVJoXAXVJJN 


THPOPFTTPATi 3 COR? 


1 


ELFPAPILR 


167 


45 


2 


ATEDNQVWK 


243 


30 


3 


ELRELSGHR 


341 


22.5 


4 


DVRVLLGHL 


330 


18 


5 


ALGPPGPNR 


118 


15 


6 


LPLQRSAAR 


196 


10 


7 


ETLQCEGPV 


31 


9 


8 


RVLLGHLLM 


332 


8 


9 


WAITVQEL 


160 


8 


10 


WVAQMLLAA 


20 


8 
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11 


GVLWVAQML 


17 


8 


12 


KQSPQLEIR 


251 


7.5 


13 


PTLNPAPQK 


270 


6 


14 


AVPSAEPQA 


176 


6 


15 


ETAS WAIT 


156 


6 


16 


WQKADSGH 


132 


6 


17 


WLILQGPAK 


79 


6 


18 


LLKHMQDVR 


324 


5 


19 


CQTKLPLQR 


192 


5 


20 


RVQGASSSA 


259 


4 



Lyl464 B7 binding peptides 

RANKING SEQUENCE STARTING POSITION THEORETICAL SCORE 



1 


DVRVLLGHL 


330 


200 


2 


KPVFEGDLL 


87 


80 


3 


APEEAPGPL 


285 


72 


4 


LQRSAARLL 


198 


40 


5 


QVWKQSPQL 


248 


20 


6 


WAITVQEL 


160 


20 


7 


IVSYDWLIL 


74 


20 


8 


GVLWVAQML 


17 


20 


9 


VLMAWALYL 


6 


12 


10 


MAWALYLSL 


8 


12 


11. 


ALYLSLGVL 


11 


12 


12 


HMQDVRVLL 


327 


6 


13 


APGTAPEEA 


281 


6 


14 


SPMTLSCQT 


186 


6 


15 


DAREAGFQV 


54 


6 


16 


RVLLGHLLM 


332 


5 


17 


HLLMELREL 


337 


4 


18 


LLGHLLMEL 


334 


4 


19 


VLWVAQMLL 


18 


4 


20 


HLYHQMGLL 


316 


4 



Lyl464 B8 binding peptides 

RANKING SEQUENCE STARTING POSITION THEORETICAL SCORE 



1 


DVRVLLGHL 


330 


8 


2 


DAREAGFQV 


54 


7.2 


3 


LQRSAARLL 


198 


1.2 


4 


FSFYKDGRI 


207 


1 


5 


GCVLMAWAL 


4 


0.8 


6 


MAWALYLSL 


8 


0.8 


7 


LLGHLLMEL 


334 


0.4 


8 


HLYHQMGLL 


316 


0.4 


9 


VLMAWALYL 


6 


0.4 


10 


AARLLFSFY 


202 


0.4 


11 


APILRAVPS 


171 


0.4 


12 


ALYLSLGVL 


11 


0.4 


13 


HLIVSYDWL 


72 


0.4 


14 


LLAASFETL 


25 


0.4 


15 


VLWVAQMLL 


18 


0.4 


16 


HMQDVRVLL 


327 


0.3 


17 


APEEAPGPL 


285 


0.24 


18 


LSGHRKPGT 


345 


0.2 


19 


GVLWVAQML 


17 


0.2 


20 


SSSAAPPTL 


264 


0.2 



Lyl464 B27 binding peptides 



RANKING SEQUENCE STARTING POSITION THEORETICAL SCORE 

1 LRELSGHRK 342 2000 

2 VRVLLGHLL 331 2000 

3 ARLLFSFYK 203 2000 

4 AREAGFQVK 55 2000 

5 QRSAARLLF 199 1000 
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6 


GRIVQSRGL 


213 


600 


7 


NREFSITW 


125 


600 


8 


KQSPQLEIR 


251 


300 


9 


HRKPGTTKA 


348 


200 


10 


IRVQGASSS 


258 


200 


11 


CQAWQDWPL 


99 


200 


12 


LRCQAWQDW 


97 


200 


13 


HLYHQMGLL 


316 


150 


14 


ALYLSLGVL 


11 


150 


15 


VLWVAQMLL 


18 


150 


16 


CQTKLPLQR 


192 


100 


17 


VQKADSGHY 


133 


100 


18 


LQGPAKPVF 


82 


100 


19 


AQMLLAASF 


22 


100 


20 


KAYTFSEPF 


63 


75 



Lyl464 B35 binding peptides 



RANKING 


SEQUENCE 


STARTING POSITION 


THEORETICAL SCORE 


1 


KPVFEGDLL 


87 


60 


2 


WPLTQVTFY 


105 


40 


3 


EPFHLIVSY 


69 


40 


4 


TPSSEDPGF 


298 


30 


5 


GPPGPNREF 


120 


20 


6 


AARLLFSFY 


202 


18 


7 


QSRGLSSEF 


217 


15 


6 


APEEAPGPL 


285 


12 


9 


VQKADSGHY 


133 


9 


10 


ASEDHSGSY 


230 


6 


11 


KAYTFSEPF 


63 


6 


12 


EAGFQVKAY 


57 


6 


13 


SSSAAPPTL 


264 


5 


14 


LSCQTKLPL 


190 


5 


15 


SSCHTEDDL 


44 


5 


16 


AATEDNQW 


242 


4.5 


17 


KPGTTKATA 


350 


4 


18 


RVLLGHLLM 


332 


4 


19 


FPAPILRAV 


169 


4 


, 20 


DAREAGFQV 


54 


3.6 



Lyl464 B44 binding peptides 



RANKING 


SEQUENCE 


STARTING POSITION 


THEORETICAL SCORE 


1 


LEIRVQGAS 


256 


30 


2 


WPLTQVTFY 


105 


27 


3 


SEDHSGSYW 


231 


24 


4 


REAGFQVKA 


56 


18 


5 


EAGFQVKAY 


57 


13.5 


6 


LGMPDPHLY 


310 


12 


7 


SEFQIPTAS 


223 


12 


8 


AEPQAGSPM 


180 


12 


9 


QELFPAPIL 


166 


12 


10 


SEPFHLIVS 


68 


12 


11 


SAARLLFSF 


201 


9 


12 


VAITVQELF 


161 


7.5 


13 


ASEDHSGSY 


230 


6 


14 


AARLLFSFY 


202 


6 


15 


DDLTDAREA 


50 


4.5 


16 


FSITWQKA 


128 


3.375 


17 


EPFHLIVSY 


69 


3 


18 


AQMLLAASF 


22 


3 


19 


TEDNQVWKQ 


244 


1.8 


20 


CEGPVCTEE 


35 


1.8 
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ANALYSIS RESULTS OF THE PROGRAM TSITES. 

These are the results of the analysis of the file --> LY1 464-1 TXT 
Beginning with residue: 1 and ending with residue: 359 
AMPHI Window size: 11 

A - AMPHI mid points of blocks. 

R - Residues matching the Rothbard/Taylor motif. 

D - Residues matching the lAd motif. 

d - Residues matching the I Ed motif. 



5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 
MKLGCVLMAWALYLSLGVLWVAQMLLAASFETLQCEGPVCTEESSCHTEDDLTDAREA 
GFQVKAYTFSEPFHLIV 

AAAAAAA... 

RRRR RRRR RRRR RRRR 

.DDDDDD 



80 85 90 95 100 105 110 115 120 125 130 135 140 145 150 
SYDWLILQGPAKPVFEGDLLVLRCQAWQDWPLTQVTFYRDGSALGPPGPNREFSITVVQ 
KADSGHYHCSGIFQSP 

AA AAAAAAAA AAAAAAAA 

R RRRR.. 



155 160 165 170 175 180 185 190 195 200 205 210 215 220 225 
GPGIPETASVVAITVQELFPAPILRAVPSAEPQAGSPMTLSCQTKLPLQRSAARLLFSFYK 
DGRIVQSRGLSSEF 

AAAAAAAAAA AAAAAAA 

RRRR RRRRR RRRRR RRRR 

DDDDDD DDDDDD DDDDDD DDDDDD.. 
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230 235 240 245 250 255 260 265 270 275 280 285 290 295 300 
QIPTASEDHSGSYWCEAATEDNQVWKQSPQLEIRVQGASSSAAPPTLNPAPQKSAAPG 
TAPEEAPGPLPPPPTPS 

AAAAA 

RRRRR RRRRR 

DDDDDDDDD 



305 310 315 320 325 330 335 340 345 350 355 360 365 370 375 
SEDPGFSSPLGMPDPHLYHQMGLLLKHMQDVRVLLGHLLMELRELSGHRKPGTTKATA 
E 

AAA AAAAAAAA AAAAAAAAAAAAAA 

RRRR..RRRRR RRRRRRRRRR RRRR.... 

DDDDDD 
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Table 4. Immunogenic portions of Lyl464 ($E:Q ID ND^- \ D^SZlo - ( 0^ 

MKLGCVLMAWALYLSLGVLWAQMLLAASF 

LGVLWVAQMLLAAS FETLQCEGP VCTEES S 

ETLQCEGPVCTEES S CHTEDDLTDAREAGF 

CHTEDDLTDAREAGFQVKAYTFSEPFHLIV 

QVKAYTFSEPFHLIVSYDWLILQGPAKPVF 

SYDWLILQGPAKPVFEGDLLVLRCQAWQDW 

EGDLLVLRCQAWQDWPLTQVTFYRDGSALG 

PLTQVTFYRDGSALGPPGPNREFSITWQK 

PPGPNREFSITWQKADSGHYHCSGIFQSP 
ADSGHYHCSGIFQSPGPGIPETASWAITV 

GPGIPETASWAITVQELFPAPILRAVPSA 

QELFPAPILRAVPSAEPQAGSPMTLSCQTK 

EPQAGSPMTLSCQTKLPLQRSAARLLFSFY 

LPLQRSAARLLFSFYKDGRIVQSRGLSSEF 

KDGRIVQSRGLSSEFQIPTASEDHSGSYWC 

QIPTASEDHSGSYWCEAATEDNQVWKQSPQ 

EAATEDNQVWKQSPQLEIRVQGASSSAAPP 

LEIRVQGASSSAAPPTLNPAPQKSAAPGTA 

TLNP APQKS AAPGTAPEEAPGPLP P P PTP S 

PEEAPGPLPPPPTPSSEDPGFSSPLGMPDP 

SEDPGFSSPLGMPDPHLYHQMGLLLKHMQD 



VRVLLGHLLMELREL SGHRKPGTTKATAE 



HLYHQMGLLLKHMQDVRVLLGHLLMELREL 



Ifl 



ru 
:; 

Si 
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Ra12-LY1464 profile 

LY1464 and recombinant Ra12-LY1464 

Researcher: Alex Gaiger, Aijun Wang, Jonathan Clapper 
Reference: cloning bk813 pgl50 7 151, 154-157; expression bk966 pg2-3 

Description: LY1464 is an antigen discovered by Lymphoma program. The full-length sequence was 
cloned through PCR amplification of normal tissue cDNA known to express the gene. The 
gene was cloned directly into the pCRX2 vector as an N-terminal Ral2 fusion, Ral2- 
LY1464, using restriction enzymes Nco I and Xho I, and the sequence was subsequently 
confirmed. The optimal protein expression conditions were determined by mini-induction 
screening. 



CORIXA CORPORATION 
Antigen Discovery 



Protein Diagram of Ral2-LY1464 



N-term 6x His tag 



Cloning Strategy: 

Tumor and normal tissue derived cDNAs (from Alex Gaiger) were each used separately as 
templates in PCR amplification reactions with two primers (LY1464-sense and Lyl464- 
antisense) flanking the 5' and 3' ends of the gene LY1464. As shown below, the primers 
contained short 5' restriction enzyme recognition sequences. The PCR products were 
separted on agarose gel and the DNA band of ~1080bp was gel purified. This 'insert' was 
^ digested using restriction enzymes Nco I and Xho I and ligated to pCRX2, which was 

linearized with the same two restriction enzymes and then dephosphorylated. The vector 
and insert were combined in equi-molar ratios with DNA ligase to create a completed 
plasmid construct pCRX2 Ral2-LY1464. This ligation mixture was used to transform 
competent E. coli Novablue cells. Ten individual colonies from both tumor cDNA derived 
and normal cDNA derived construct were picked for DNA miniprep. Miniprep samples 
were screened for insert through restriction enzyme digestion (Nco I, Xho I) and five 
positive samples were sent for DNA sequencing. Miniprep clone #4 (Corixa seq 
ID#90522) was confirmed and used for expression of recombinant Ral2-LY1464 in E. 
coli. 



Cloning Primers: 

m LY1464-sense , ^ 

q 5' - CGTCCATGGACatgaagctgggctgtgtcctc - 3' (pE^lD NO*< lO^M3J 

St prime 21bp 57%GC 56C Tm 

,w full length 32bp 59%GC 68C Tm 



LY1464-antisense . ^ 

5 1 - CCTTCTCGAGctattcagcagtagcctttgtggtc - 3' /S^IDI^O : I 0,V44^ 
prime 25bp 48%GC 58C Tm 
full length 35bp 51%GC 67C Tm 

Protein Expression: 

Various E. coli strain/culture conditions were screened for optimal expression conditions for 
recombinant protein expression- Briefly, the expression construct was used to transform different 
expression hosts, and then mini-induction cultures were screened at varied culture temperature, 
culture media and/or IPTG concentration. The optimal expression condition was determined by 
evaluating the results of SDS-PAGE and western blot. 

For Ral2-LY1464, The most optimal expression condition is pCRX2 Ral2-LY1464 in Tuner (DE3) 
CodonPlus-RP grown in 2xYS media at 37°C induced with l.OmM IPTG at 25°C (room temp) for 3hr. 
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Ra12-LY1464 profile CORIXA CORPORATION 

Antigen Discovery 

DNA/Protein Sequence: 

LY1464(DNA) 1080bp (SZCXXOtlb iO^S) 

atgaagctgggctgtgtcctcatggcctgggccctctacctttcccttggtgtgctctgggtggcccagat 
gctactggctgccagttttgagacgctgcagtgtgagggacctgtctgcactgaggagagcagctgccaca 
cggaggatgacttgactgatgcaagggaagctggcttccaggtcaaggcctacactttcagtgaacccttc 
cacctgattgtgtcctatgactggctgatcct ccaagcjt ccagccaagccagtttt t gaaggggacc tgc t 
ggttctgcgctgccaggcctggcaagactggccactgactcaggtgaccttctaccgagatggctcagctc 
tgggtccccccgggcctaacagggaattctccatcaccgtggtacaaaaggcagacagcgggcactaccac 
tgcagtggcatcttccagagccctggtcctgggatcccagaaacagcatctgttgtggctatcacagtcca 
agaactgtttccagcgccaattctcagagctgtaccctcagctgaaccccaagcaggaagccccatgaccc 
tgagttgtcagacaaagttgcccctgcagaggtcagctgcccgcctcctcttctccttctacaaggatgga 
aggatagtgcaaagcagggggct ctcct cagaattccagatccccacagctt cagaagatcactccgggt c 
atactggtgtgaggcagccactgaggacaaccaagtttggaaacagagcccccagctagagatcagagtgc 
agggtgcttccagctctgctgcacctcccacattgaatccagctcctcagaaatcagctgctccaggaact 
gctcctgaggaggcccctgggcctctgcctccgccgccaaccccatcttctgaggatccaggcttttcttc 
tcctctggggatgccagatcctcatctgtatcaccagatgggccttcttctcaaacacatgcaggatgtga 
gagtcctcctcggtcacctgctcatggagttgagggaattatctggccaccggaagcctgggaccacaaag 
gctactgctgaatag 

Ral2-LY1464 (DNA) 1500bp (5£Q 3£> t$b\ 10 ^IrA 

atgcatcaccatcaccatcacacggccgcgtccgataacx tccagctgtcccagggtgggcagggattcgc 
cattccgatcgggcaggcgatggcgatcgcgggccagatcaagcttcccaccgttcatatcgggcctaccg 
ccttcctcggcttgggtgttgtcgacaacaacggcaacggcgcacgagtccaacgcgtggtcgggagcgct 
^cggcggcaagtctcggcatctccaccggcgacgtgat caccgcggtcgacggcgc t ccga t caact cggc 
paccgcgatggcggacgcgcttaacgggcatcatcccggtgacgtcatctcggtgacctggcaaaccaagt 
pfgggcggcacgcgtacagggaacgtgacattggccgagggacccccggccgaattctccatggacatgaag 
ptgggctgtgtcctcatggcctgggccctctacctttcccttggtgtgctctgggtggcccagatgctact 
Jfgctgccagttttgagacgctgcagtgtgagggacctgtctgcactgaggagagcagctgccacacggagg 
"^itgacttgactgatgcaagggaagctggcttccaggtcaaggcctacactttcagtgaacccttccacctg 
ffittgtgtcctatgactggctgatcctccaaggtccagccaagccagtttttgaaggggacctgctggttct 
^cgctgccaggcctggcaagactggccactgactcaggtgaccttctaccgagatggctcagctctgggtc 
mccccgggcctaacagggaattctccatcaccgtggtacaaaaggcagacagcgggcactaccactgcagt 
^ggcatcttccagagccctggtcctgggatcccagaaacagcatctgttgtggctatcacagtccaagaact 
tgtttccagcgccaattctcagagctgtaccctcagctgaaccccaagcaggaagccccatgaccctgagtt 
^tcagacaaagttgcccctgcagaggtcagctgcccgcctcctcttctccttctacaaggatggaaggata 
l^tgcaaagcagggggctctcctcagaattccagatccccacagcttcagaagatcactccgggtcatactg 
fptgtgaggcagccactgaggacaaccaagtttggaaacagagcccccagctagagatcagagtgcagggtg 
[if tt ccagctctgctgcacctcccacattgaatccagctcctcagaaat cage tgc tccaggaact get cct 
^aggaggcccctgggcctctgcctccgccgccaaccccatcttctgaggatccaggcttttcttctcctct 
^gggatgccagatcctcatctgtatcaccagatgggccttcttctcaaacacatgcaggatgtgagagtcc 
^icctcggtcacctgctcatggagttgagggaattatctggccaccggaagcctgggaccacaaaggctact 
gctgaatag 

Ral2-LY1464 (protein) C^ Sd *** v0 ,^V) 

MHHHHHHTAASDNFQLSQGGQGFAI PIGQAMAIAGQIKLPTVHIGPTAFLGLGVVDNNGNGARVQRVVGSA 
PAASLGI STGDVITAVDGAPINSATAMADALNGHHPGDVISVTWQTKSGGTRTGNVTLAEGPPAEFSMDMK 
LGCVLMAWALYLSLGVLWVAQMLLAASFETLQCEGPVCTEESSCHTEDDLTDAREAGFQVKAYTFSEPFHL 
I VS YD WL I LQGPAKPVFEGD LLVLRCQAWQDWP LTQVTFYRDGSALGP PGPNREFS IT VVQKADSGHYH CS 
GIFQSPGPGIPETASVVAITVQELFPAPILRAVPSAEPQAGSPMTLSCQTKLPLQRSAARLLFSFYKDGRI 
VQSRGLSSEFQIPTASEDHSGSYWCEAATEDNQVWKQSPQLEIRVQGASSSAAPPTLNPAPQKSAAPGTAP 
EEAPGPLPPPPTPSSEDPGFSSPLGMPDPHLYHQMGLLLKHMQDVRVLLGHLLMELRELSGHRKPGTTKAT 
AE . 



Protein Info: Ral2.LY1464 

Molecular Weight 53010.15 Daltons 
499 Amino Acids 

29 Strongly Basic(+) Amino Acids (K,R) 
46 Strongly Acidic(-) Amino Acids (D,E) 
174 Hydrophobic Amino Acids (A,I,L,F,W,V) 
128 Polar Amino Acids (N,C,Q,S,T,Y) 
5.663 Isolectric Point 
-14.119 Charge at PH 7.0 
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Ly hPfl DNA Sequence 6£ Q ID ND". \ 0 ) UJ 



1 agcgagactt ccagtccgag gtcctgcttt ctgctatgga actattccac atgacaagtg 
61 gaggtgatgc agcgatgttc agagacggca aagagcctca gccaagtgca gaagctgctg 
121 ctgccccttc tcttgccaac atctcctgct tcacccagaa gctggtggag aagctgtaca 
181 gtgggatgtt ctcggcagac cccaggcata tcctcctctt catcctggag cacatcatgg 
241 tggtcattga gactgcctct tctcaaaggg acactgtcct cagcacttta tacagcagtt 
301 taaataaagt cattctttat tgcctatcca agccccagca gtccctctcc gaatgcctcg 
361 gccttctcag catcctgggc tttctgcagg agcactggga tgttgtcttt gccacctaca 
421 attccaacat cagcttcctc ctgtgtctca tgcattgcct tttgctactc aatgagagaa 
481 gttacccaga aggatttgga ttggagccca agcctagaat gtctacttat catcaagtct 
541 tcctttcccc aaatgaagac gtgaaagaaa aaagagaaga cttaccaagt ttgagtgatg 
601 tccaacacaa catccagaag acagtgcaga ctctctggca gcagctggtg gcacaaaggc 
661 agcagaccct ggaggatgcc ttcaagatcg atctctctgt gaaacctgga gagagggaag 
721 tgaagattga agaggtcaca ccgctctggg aggagacgat gctcaaggcc tggcagcatt 
781 acttagcatc tgagaagaag tcactggcaa gtcgttcaaa tgttgcacac cacagcaaag 
841 tcactttgtg gagtggaagc ctgtcctcag ccatgaagct gatgcccggg cggcaggcca 
901 aggaccctga gtgcaagaca gaggattttg tgtcatgtat agagaactac agaagaagag 
961 gacaagagct atatgcatct ttatacaaag accatgtgca aaggcgaaaa tgtggcaaca 
1021 tcaaggcagc caacgcctgg gccaggatcc aggagcagct ttttggggag ctgggcttgt 
1081 ggagccaggg ggaagaaacc aagccctgtt ccccatggga actcgactgg agagaaggac 
1141 cagctcgaat gaggaaacgc atcaaacgct tgtctccttt ggaggccctg agctcaggaa 
1201 ggcacaagga aagccaagac aaaaatgatc atatttctca aacaaatgct gaaaaccaag 
1261 atgaactgac actgagggag gctgagggcg agccggacga ggtgggggtg gactgcaccc 
1321 agctgacctt cttcccagcc ttacacgaaa gtctgcactc agaagacttc ttggaactgt 
1381 gtcgggaaag acaagttatt ttacaagagc ttcttgataa agaaaaggtg acgcagaagt 
1441 tctccctggt gattgtgcag ggccacctgg tgtcagaagg ggtcctgctt tttggccacc 
1501 aacacttcta catctgcgag aacttcacac tgtctcccac gggtgatgtc tactgtaccc 
1561 gtcactgctt atccaacatc agcgatccgt tcattttcaa cctgtgcagc aaagacaggt 
1621 ccactgacca ttactcgtgc cagtgccaca gctacgctga catgcgggag ctacggcagg 
1681 ctcgcttcct cctgcaggac atcgccctgg agatcttctt ccacaatgga tattccaagt 
1741 ttcttgtctt ctacaacaat gatcggagta aggcctttaa aagcttctgc tctttccaac 
1801 ccagcctgaa ggggaaagcc acctcggagg acaccctcaa tctaaggaga taccccggct 
1861 ctgacaggat catgctgcag aagtggcaga aaagggacat cagcaatttt gagtatctca 
1921 tgtacctcaa caccgcggct gggagaacct gcaatgacta catgcagtac ccagtgttcc 
1981 cctgggtcct cgcagactac acctcagaga cattgaactt ggcaaatccg aagattttcc 
2041 gggatctttc aaagcccatg ggggctcaga ccaaggaaag gaagctgaaa tttatccaga 
2101 ggtttaaaga agttgagaaa actgaaggag acatgactgt ccagtgccac tactacaccc 
2161 actactcctc ggccatcatc gtggcctcct acctggtccg gatgccaccc ttcacccagg 
2221 ccttctgcgc tctgcagggc ggaagcttcg acgtggcaga cagaatgttc cacagtgtga 
2281 agagcacgtg ggagtcggcc tccagagaga acatgagtga cgtcagggag ctgaccccag 
2341 agttcttcta cctgcctgag ttcttaacca actgcaacgg ggtagagttc ggctgcatgc 
2401 aggacgggac tgtgctagga gacgtgcagc tccctccctg ggctgatggg gaccctcgga 
2461 aattcatcag cctgcacaga aaggccctgg aaagtgactt tgtcagtgcc aacctccacc 
252l*attggataga ccttattttt gggtacaagc agcaggggcc agccgcagtg gatgctgtta 
2581 atatcttcca cccctacttc tacggtgaca gaatggacct cagcagcatc actgaccccc 
2641 tcatcaaaag caccatcctg gggtttgtca gcaactttgg acaggtgccc aaacagctct 
2701 ttaccaaacc tcacccagcc aggactgcag cagggaagcc tctgcctgga aaggatgtct 
2761 ccacccccgt gagcctgcct ggccacccac agcccttttt ctacagcctg cagtcgctga 
2821 ggccctccca ggtcacggtc aaagatatgt acctcttttc tctaggctca gagtccccca 
2881 aaggggccat tggccacatt gtctctactg agaagaccat tctggctgta gagaggaaca 
2941 aagtgctgct gcctcctctc tggaacagga ccttcagctg gggctttgat gacttcagct 
3001 gctgcttggg gagctacggc tccgacaagg tcctgatgac attcgagaac ctggctgcct 
3061 ggggccgctg tctgtgcgcc gtgtgcccat ccccaacaac gattgtcacc tctgggacca 
3121 gcactgtggt gtgtgtgtgg gagctcagca tgaccaaagg ccgcccgagg ggcttgcgcc 
3181 tccggcaggc cttgtatgga cacacacagg ctgtcacgtg cctggcagcg tcagtcacct 
3241 tcagcctcct ggtgagcggc tcccaggact gcacctgtat cctgtgggat ctggaccacc 
3301 tcacccacgt gacccgcctg cccgcccatc gggaaggcat ctcagccatc accatcagtg 
3361 acgtctcagg caccattgtc tcctgtgcgg gagcacactt gtccctgtgg aatgtcaatg 
3421 gacagcccct ggccagcatc accacagcct ggggcccaga aggagccata acctgttgct 
3481 gcctgatgga gggcccagca tgggacacaa gccagatcat catcaccggg agtcaagacg 
3541 gcatggtccg ggtttggaag actgaggatg tgaagatgtc tgttcctgga cggccagcag 
3601 gagaggagcc cctggctcag cctccaagcc caagaggcca caagtgggag aagaacctgg 
3661 ccttgagtcg agagctggac gttagcattg ctttgacagg gaagcccagc aaaaccagcc 
3721 ccgcagtgac tgctctggcc gtgtccagaa accacaccaa actcctggtt ggtgatgaga 
3781 gggggagaat attctgctgg tctgcagatg ggtaggaaga gagaggcagc agaggctctg 
3841 gcacaacagt gccaggctga gggtggcaga ggtgactggg gcctgagctc tgcctacaga 
3901 agaaaccccc agggcctcct tccccacagt tctcaaggaa gggcctctgg caatcacagc 
3961 tctgcagccc aaccctctcc atggccgatg ggacttctat gaaaaggatg agcacacaca 
4021 ctcggagggc tgagcagcac gctggaaact gtgacttggt gatgcccagc tgcacacgaa 
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4081 attacacatg actcacctta ttaagggcta ttgcactgaa aaaaaaaaaa agatgggtcg 
4141 cttactggaa attattgtat tgtctttatt ttattaaagc aactatgttt t 



RDFQSEVLLSAMELFHMTSGGDAAMFRDGKEPQPSAEAAAAPSLANISCF 

TQKLVEKLYSGMFSADPRHILLFILEHIMWIETASSQRDTVLSTLYSSL 

NKVILYCLSKPQQSLSECLGLLS ILGFLQEHWDVVFATYNSNI S FLLCLM 

HCLLLLNERSYPEGFGLEPKPRMSTYHQVFLSPNEDVKEKREDLPSLSDV 

QHNIQKTVQTLWQQLVAQRQQTLEDAFKIDLSVKPGEREVKIEEVTPLWE 

ETMLKAWQHYLASEKKSLASRSNVAHHSKVTLWSGSLS SAMKLMPGRQAK 

D PECKTED FVS C I ENYRRRGQELYAS LYKDHVQRRKCGN I KAANAWARI Q 

EQLFGELGLWSQGEETKPCSPWELDWREGPARMRKRIKRIjSPLEALSSGR 

HKESQDKNDHISQTNAENQDELTLREAEGEPDEVGVDCTQLTFFPALHES 

LHSEDFLELCRERQVILQELLDKEKVTQKFSLVIVQGHLVSEGVLLFGHQ 

HFYICENETLSPTGDVYCTRHCLSNISDPFIFNLCSKDRSTDHYSCQCHS 

YADMRELRQARFLLQDI ALEI FFHNGYSKFLVFYNNDRSKAFKS FCSFQP 

SLKGKATSEDTLNLRRYPGSDRIMLQKWQK3U)ISNFEYLM^ 

NDYMQYPVFPWVLADYTSETIiNIANPKIFRDLSKPMGAQTKERKLKFIQR 

FKEVEKTEGDMTVQCHYYTHYSSAI I VAS YLVRMP PFTQAFCALQGGS FD 

VADRMFHSVKSTWESASRENMSDVRELTPEFFYLPEFLTNCNGVEFGCMQ 

DGTVLGDVQLPPWADGDPRKF I SLHRKALESDFVS ANLHHWIDLI FGYKQ 

QGPAAVDAVNIFHPYFYGDRMDLSS ITDPLI KSTTLGFVSNFGQVPKQLF 

TKPHPARTAAGKPLPGKDVSTPVSLPGHPQPFFYSLQSLRPSQVTVKDMY 

LFSLGSESPKGAIGHIVSTEKTILAVERNKVLLPPLWNRTFSWGFDDFSC 

CLGSYGSDKVLMTFENIAAWGRCL^VCPSPTTIVTSGTST^ 

TKGRPRGLRLRQALYGHTQAVTCLAASVTFSLLVSGSQDCTCILWDLDHL 

THVTRLPAHREGI SAITI SDVSGTIVSCAGAHLSLWNVNGQPLAS ITTAW 

GPEGAITCCCLMEGPAWDTSQI I ITGSQDGMVRVWKTEDVKMSVPGRPAG 

EEPLAQP PS PRGHKWEKNLALSRELDVS I ALTGKPS KTS PAVTALAVSRN 

HTKLLVGDERGRI FC WS ADG 



MLQKWQKRD I SNFEYLMYLNTAAGRTQTOYMQYPVFPWVLADYTSETLNL 
ANPKI FRDLSKPMGAQTKERKLKFIQRFKEVEKTEGDMWQCHYYTHYS S 
AI I VAS YLVRM P PFTQAFCALQGGS FDVADRMFHS VKSTWES AS RENMS D 
VRELT PE F FYL PE FLTNCNGVE FGCMQDGTVLGDVQL P P WADGD PRKF I S 
LHRKALESDFVSANLHHWIDLIFGYKQQGPAAVDAVNIFHPYFYGDRMDL 
SSITDPLIKSTILGFVSNFGQVPKQLFTKPHPARTAAGKPLPGKDVSTPV 
SLPGHPQPFFYSLQSLRPSQVTVKDMYLFSLGSESPKGAIGHIVSTEKTI 
LAVERNKVLLPPLWNRTFSWGFDDFSCCLGSYGSDKVW4TFENLAAWGRC 
LCAVCPS PTT I VTSGTSTWCVWELSMTKGRPRGLRLRQALYGHTQAVTC 
LAAS VTFSLLVSGSQDCTC ILWDLDHLTHVTRLPAHREGI SAITI SDVSG 
TI VSCAGAHLSLWNVNGQPLAS ITTAWGPEGAITCCCLMEGPAWDTSQI I 
ITGSQDGKVRWKTEDVKMSVPGRPAGEEPLAQPPSPRGHKWEKNLALSR 
ELDVSIALTGKPSKTSPAVTALAVSRNHTKLLVGDERGRIFCWSADG 



Lyl484. Protein sequence 



(long) L5£&SO WD . 10^41) 
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a. TMpred Report for Ly1484 Long 

Date: 



RDFQSEVLLSAMELFHMTSGGDAAMFRDGKEPQPSAEAAAAPSLANISCF 
TQKLVEKLYSGMFSADPRHILLFILEHIMVVIETASSQRDTVLSTLYSSL 
NKVILYCLSKPQQSLSECLGLLSILGFIiQEHWDWFATYNSNISFLLCLM 
HCLLLLNERSYPEGFGLEPKPRMSTYHQVFLSPNEDVKEKREDLPSLSDV 
QHNIQKTVQTLWQQLVAQRQQTLEDAFKIDLSVKPGEREVKIEEVTPLWE 
ETMLKAWQHYLAS EKKS LAS RSNVAHHS KVTLWS G S L S S AMKLMPGRQ AK 
DPECKTEDFVSCIENYRRRGQELYASLYKDHVQRRKCGNIKAANAWARIQ 
EQLFGELGLWSQGEETKPCSPWELDWREGPARMRKRIKRLSPLEALSSGR 
HKESQDKNDHISQTNAENQDELTLREAEGEPDEVGVDCTQLTFFPALHES 
IiHSEDFLELCRERQVIIiQELLDKEKVTQKFSLVIVQGHLVSEGVLIiFGHQ 
HFYICENFTLSPTGDVYCTRHCLSNISDPFIFNLCSKDRSTDHYSCQCHS 
YADMRELRQARFLLQDIALEIFFHNGYSKFLVFYNNDRSKAFKSFCSFQP 
S LKGKATS EDTLNLRRYPGSDRIMLQKWQKRDI SNFEYLMYLNTAAGRTC 
NDYMQYPVFPWVLADYTSETLNLANPKIFRDLSKPMGAQTKERKLKFIQR 
FKEVEKT E GDMTVQCHYYTHYS S AI I VAS YLVRMP P FTQAFCALQGGS FD 
VADRMFHSVKSTWESASRENMSDVRELTPEFFYLPEFLTNCNGVEFGCMQ 
DGTVLGDVQLPPWADGDPRKFISLHRKALESDFVSANLHHWIDLIFGYKQ 
QGPAAVDAVNIFHPYFYGDRMDLSSITDPLIKSTILGFVSNFGQVPKQLF 
TKPHPARTAAGKPLPGia)VSTPVSLPGHPQPFFYSLQSLRPSQVTVKDMY 
LFSLGSESPKGAIGHIVSTEKTILAVERNKVLLPPLWNRTFSWGFDDFSC 
CLGSYGSDKVLMTFENLAAWGRCLCAVCP S PTTI VTSGTSTWCVWELSM 
TKGRPRGLRLRQALYGHTQAVTCLAASVTFSLLVSGSQDCTCILWDLDHL 
THVTRLPAHREGISAITISDVSGTIVSCAGAHLSLWNVNGQPIASITTAW 
GPEGAITCCCLMEGPAWDTSQI I ITGSQDGMVRVWKTEDVKMSVPGRPAG 
EEPLAQP PS PRGHKWEKNLALSRELDVS I ALTGKPSKTS P AVTALAVS RN 
HTKLL VGDERGRI FCWS ADG ^5£ QlDKi0 ; 

Black 5= intracellular , Red = Transmembrane, Blue ■ Extracellular 

Iiyl484 Long has 1269 amino acids and 5 Transmembrane Domains 
Transmembrane Domain 1: 63 - 84 Score: 1.36675 
Transmembrane Domain 2: 118 - 139 Score: 1.38695 
Transmembrane Domain 3: 480 - 501 Score: 1.36185 
Transmembrane Domain 4: 562 - 583 Score: 1.31785 
Transmembrane Domain 5: 725 - 746 Score: 1.3521 



b. TMpred Report for Ly1484 (short) 



MLQKWQKRD I SNF EYLMYLNTAAGRTCNDYMQ YP VF P WVLAD YT S ETLNL 
ANPKIFRDLSKPMGAQTKERKLKFIQRFKEVEKTEGDMTVQCHYYTHYSS 
AIIVASYLVRMPP FTQAFCALQGGS FDVADRMFHSVKSTWE S AS RENMSD 
VRELTPEFFYLPEFLTNCNGVEFGCMQDGTVLGDVQLPPWADGDPRKFIS 
LHRKALESDFVSANLHHWIDLIFGYKQQGPAAVDAVNIFHPYFYGDRMDL 
S S ITDPLIKSTILGFVSNFGQVPKQLFTKPHPARTAAGKPLPGKDVSTPV 
SLPGHPQPFFYSLQSLRPSQVTVKDMYLFSLGSESPKGAIGHIVSTEKTI 
IJVVERNKVLLPPLWNRTFSWGFDDFSCCLGSYGSDKVLMTFENLAAWGRC 
LCAVCP S PTTIVTSGTSTVVCVWELSMTKGRPRGLRLRQALYGHTQAVTC 
LAASVTFSLLVSGSQDCTCILWDLDHLTHVTRLPAHREGISAITISDVSG 
TIVSCAGAHLSLWNVNGQPLASITTAWGPEGAITCCCLMEGPAWDTSQII 
ITGSQDGMVRVWKTEDVKMSVPGRPAGEEPLAQPPSPRGHKWEKNLALSR 

ELDVS I ALTGKPSKTS PAVTALAVS RNHTKLLVGDERGRX FCWS ADG [<>£Q JO t^O V4V^ 
Black m intracellular. Red = Transmembrane, Blue = Extracellular 
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Lyl484 has 646 amino acids and 1 Transmembrane Domains 

Transmembrane Domain 1: 102 - 123 Score: 1.3521 
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Table2a: Lyl484 long MHC class I binding peptides 



HLA A2: 

RANKING 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 

HLA A3 

RANKING 

1 
2 
3 
4 
5 
6 
7 
6 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 



SEQUENCE 

FLLCLMHCL 
YLPEFLTNC 
KLYSGMFSA 
FILEHIMW 
TLYSSLNKV 
FLQEHWDW 
ILWDLDHLT 
ALQGGSFDV 
MQYPVFPWV 
ALYGHTQAV 
YLVRMPPFT 
CLMHCLLLL 
GLLSILGFL 
VLMTFENLA 
ILAVERNKV 
YLASEKKSL 
ELTPEFFYL 
YLMYLNTAA 
ILLFILEHI 
LLQDIALEI 



SEQUENCE 

KLMPGRQAK 

VLLPPLWNR 

CLGSYGSDK 

KVILYCLSK 

CVWELSMTK 

LLFGHQHFY 

PLWEETMLK 

FLSPNEDVK 

SLYKDHVQR 

FLVFYNNDR 

KLYSGMFSA 

GLRLRQALY 

NLANPKIFR 

RVWKTEDVK 

QVPKQLFTK 

RMPPFTQAF 

KLKFIQRFK 

RIMLQKWQK 

VILQELLDK 

IMLQKWQKR 



STARTING POSITION 

145 
783 

57 

73 

95 
127 
1093 
743 
654 
1063 
730 
148 
120 
1010 
973 
260 
776 
638 

70 
563 



STARTING POSITION 



THEORETICAL SCORE 

836.3 
818.9 
742.3 
629.3 
511.9 

448 
431.1 
403.4 
400.9 
222.6 
188.5 
181.8 

130 
118.7 
118.2 
98.27 
97.11 
84,. 56 
73.96 
72.72 



THEORETICAL SCORE 



292 


135 


981 


60.75 


1001 


60 


102 


54 


1044 


30 


495 


30 


247 


30 


180 


30 


326 


20 


580 


18 


57 


13.5 


1057 


12 


672 


12 


1183 


10 


894 


9 


733 


9 


694 


9 


622 


9 


465 


9 


623 


6 



HLA A24 



RANKING SEQUENCE STARTING POSITION THEORETICAL SCORE 

1 QYPVFPWVL 655 300 

2 FYICENFTL 502 300 

3 TYNSNISFL 138 300 

4 NYRRRGQEL 315 264 

5 DYTSETLNL 665 200 

6 SYLVRMPPF 729 150 

7 RYPGSDRIM 616 75 

8 RFLLQDIAL 561 €0 

9 LYSSLNKVT 96 60 

10 YYTHYSSAI 717 50 

11 FFPALHESL 443 36 

12 NFGQVPKQL 891 28 

13 SYGSDKVU4 1004 25 

14 GFDDFSCCL 994 24 

15 EFFYLPEFL 780 24 

16 YFYGDRMDL 865 2C 

17 SFCSFQPSL 594 2C 

18 RGLRLRQAL 1056 17 .2€ 

19 RGQELYASL 319 17.28 

20 KNLALSREL 1217 IS. 84 
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RANKING 

1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
IS 
16 
17 
18 
19 
20 



short MHC class I binding pe£ 

{^56QTDrios^o^ 10 ^ - nodus' 



CID1096 Table 2b.Lyl484 short MHC class I binding pei 
A2 



SEQUENCE 

YLPEFLTNC 

ILWDLDHLT 

ALQGGSFDV 

MQYPVFPWV 

ALYGHTQAV 

YLVRMPPFT 

VLMTFENLA 

ILAVERNKV 

ELTPEFFYL 

YLMYLNTAA 

VLADYTSET 

RMFHSVKST 

SNFEYLMYL 

KVLMTFENL 

LLPPLWNRT 

QLFTKPHPA 

GAHLSLWNV 

WDLDHLTHV 

FISLHRKAL 

CILWDLDHL 



POSITION 


THEORETICAL SCORE 


160 


818.9 


470 


431.1 


120 


403.4 


31 


400.9 


440 


222.6 


107 


188.5 


387 


118.7 


350 


118.2 


153 


97.11 


15 


84.56 


39 


51.94 


131 


45.8 


11 


26.76 


386 


22.54 


359 


21.54 


275 


18.38 


507 


16.66 


472 


IS. 5 


198 


13.51 


469 


12.25 



HLA A3 

RANKING 

1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 



SEQUENCE 

VLLPPLWNR 
CLGSYGSDK 
CVWELSMTK 
GLRLRQALY 
NLANPKIFR 
RVWKTEDVK 
QVPKQLFTK 
RMPPFTQAF 
KLKFIQRFK 
TILGFVSNF 
SLPGHPQPF 
WIDLIFGYK 
NLHHWIDLI 
YMQYPVFPW 
ELTPEFFYL 
ALYGHTQAV 
YLPEFLTNC 
IIVASYLVR 
DMTVQCHYY 
QLFTKPHPA 



STARTING POSITION 

358 
378 
421 
434 

49 
560 
271 
110 

71 
261 
301 
218 
214 

30 
153 
440 
160 
102 

87 
275 



THEORETICAL SCORE 

60.75 
60 
30 
12 
12 
10 
9 
9 
9 

4.05 
3 

2.7 
2.7 
2.7 

2.43 
1.5 

1.35 
1.2 
1.2 
1 



HLA A24 

RANKING 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 



SEQUENCE 

QYPVFPWVL 

DYTSETLNL 

SYLVRMPPF 

YYTHYSSAI 

NFGQVPKQL 

SYGSDKVLM 

GFDDFSCCL 

EFFYLPEFL 

YFYGDRMDL 

RGLRLRQAL 

KNLALSREL 

KVLMTFENL 

TFSWGFDDF 

FYLPEFLTN 

ANPKIFRDL 

SFDVADRMF 

KTSPAVTAL 

KGRPRGLRL 

RNKVLLPPL 

EYLMYLNTA 



STARTING POSITION 

32 

42 
106 

94 
268 
381 
371 
157 
242 
433 
594 
386 
367 
159 
51 
125 
614 
429 
355 
' 14 



THEORETICAL SCORE 

300 
200 
150 
50 
28 
25 
24 
24 
20 
17.28 
15.84 
14.4 
12 
10.8 
10.08 
10 
9.6 
9.6 
9.6 
9 
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ANALYSIS RESULTS OF THE PROGRAM TSITES. 
**************************************** 

These are the results of the analysis of the file LY1484~1.TXT 
Beginning with residue: 1 and ending with residue: 1270 
AMPHI Window size: 11 

A - AMPHI mid points of blocks. 

R - Residues matching the Rothbard/Taylor motif. 

D - Residues matching the IAd motif. 

d - Residues matching the IEd motif. 

5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 
RDFQSEVLLS AMELFHMTSGGDAAMFRDGKEPQPS AEAAAAPSLANI S CFTQKLVEKLYSGMFSADPRH I LLFI L 

AAAAA AAAA. AAA AAAAAAAAAAA 

* ! ! ! iRRRRR. .RRRRRRR RRRRRRRRRRRR 

DDDDDD 

80 85 90 95 100 105 110 115 120 125 130 135 140 145 150 
EHIMWI ETASSQRDTVLSTLYSSLNKVILYCLSKPQQSLS ECLGLLS IIX3FI£EHWDWFATYNSN IS FLLCLM 

AAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAA 

WV.Y. IrRRR*. . .RRRRR RRRR- .VBRB. V " 

DDDDDD DDDDDDD 

155 160 165 170 175 180 185 190 195 200 205 210 215 220 225 
HCLLLLNERSYPEGFGLEPKPRMSTYHQVFLSPNEDVK^ 

.............. .... . AAAAA . AAAAAAAAAAAAA . ........... 

RRRR R 



230 235 240 245 250 255 260 265 270 275 280 285 290 295 300 
AFKI DLSVKPGEREVKIEEVTPLWEETMLKAWQHYLASEKKSLAS RSNVAHHSKVTLWSGSLSS AMKI^IPGRQAK 

AAAA. . .AAAAAAAA AAAAAAAA 

rrr RRRRRRRRRRRRR RRRRR 



.dddd. 



305 310 315 320 325 330 335 340 345 350 355 360 365 370 375 
DP EC KTEDFVS CI ENYRRRGQ ELYAS LYKDHVQRRKCGN I KAANAWAR IQEQL FG E LGLW S QGEETKPC S PWELD 

AAAAAAA AAAAAA AAAAAAAAAAAAAAA AAA 

[ .RRRR. . RRRRR RRRR ****** 



380 385 390 395 400 405 410 415 420 425 430 435 440 445 450 
WREGPARMRKRI KRLS PLEALS SGRHKES QDKNDH I S QTNAENQDELTLREAEGE PDEVG VDCTQLTF FPALHE S 
AAAAAAAAAAAAAAAAAAA AAAA. AAAA. 



.RRRR RRRRR. . . 

\ \ ~ . . . -DDDDDD DDDDDD. 

^dddd 



455 460 465 470 475 480 485 490 495 500 505 510 515 520 525 
LHSEDFLELCRERQVII^ELLDKSKVTQKFSLVIVQGHLVSEGVLLFG 

AAAAA AAAA 

!1!.RRRR" RRRR.. RRRR RRRR 



530 535 540 545 550 555 560 565 570 575 580 585 590 595 600 
ISDPFIFNLCSKDRSTDHYSCQCHSYADMRELRQARFLLQDIALEI FFHNGYSKFLVFYNNDRSKAFKS FCSFQP 

A. AAAAAA AAAAAAAAAAA ''^ MJi ' ' ' 

. . RRRRRRRRRRRRRR RRRR 



605 610 615 620 625 630 635 640 645 650 655 660 665 670 675 
SLKGKATSEDTIJ^I^RYPGSDRIMLQKWQKRDISNFEYLMYLNTAAG 

AAAAA AAA AAAAAAAAAA 

! ! [ IrRRrIrRRR RRRRR RRRR RRRRRRRR. . . 

.DDDDDD .* 

680 685 690 695 700 705 710 715 720 725 730 735 740 745 750 
PKI FRDLSKPMGAC/TKERKLKFIQRFKEVEKTEGIMIVQCHYYTHYSSAI I VAS YLVRMP PFTQAFCALQGGS FD 

AAAAAAAAAAAA AAAAAAAAAAAA AAAAA A 

.RRRR; RRRR RRRRR. RRRRR R 
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dddd 

755 760 765 770 775 780 785 790 795 800 805 810 815 820 825 
VADRMFHSVKSTWESASRENMSDWELTPEFFYLP^ 

AAAAAAAAAAA.AA.AAAA. .AAAAAA AAAAAA AAA . . . - - 

rrrrrrr RRRRR RRRRR RRR** ***** - 



830 835 840 845 850 855 860 865 870 875 880 885 890 895 . 900 
IUCALESDFVSANLHHWIDLI FGYKXX^PAAVDAVNIFHP YFYGDRMDLSS I TDPLI KSTI I/3FVSNFGQVPKQIjF 
AAAAAAAAA AAAAAAAAAAAAA AAAAA. .AAAAAAAAAAAAAAAAAA 

.RRRR. RRRR RRRRRRRR **** 'AAA^**** 

DDDDDD 

905 910 915 920 925 930 935 940 945 950 955 960 965 970 975 
TKPHPARTAAGKPLPGKDVSTPVSLPGHP^ 

AAAAAAAAA AAAAAAA 

RRRRR RRRRRRRR. 



. DDDDDD DDDDDD DDDDDD . 



980 985 990 995 1000 1005 1010 1015 1020 1025 1030 1035 1040 1045 1050 
VERNKVLLPPLWNRTFSWGFDDFSCCLGSYGSDKVIJ4TFE 

AAAAAAAAAAAAAAA 

RRRR RRRRR 

DDDDDDDDDDD DDD 



1055 1060 1065 1070 107S 1080 1085 1090 1095 1100 1105 1110 1115 1120 1125 
TKGRPRGLRLRQALYGHTQAVTCLAAS VTFS 1 

AAAAAAAAAAA AAA. . .AAAA 

•**•♦••**••*•*■**'' RRRRRRR RRR 

DDE)! 1 ! DDDDDD ! Hill ^DDDDDDDDDDD! ! 1 \ \ \ \\ \ \ \ \ \ \ \ \ \ DDDDDD. . DDDDD 

.dddddddddd 

1130 1135 1140 1145 1150 1155 1160 1165 1170 1175 1180 1185 1190 1195 1200 
VS CAGAHLS LWNVNGQPLAS ITTAWGPEGAITCCCLMEGPAWDTSQI 1 1 TGSQDGMVRVWKTEDVKMS VPGRPAG 

^ AAA * AAAAAAAA 

PR .RRRR RRRRRRR 

D?l ! \ \\ \ \ W \ \ \ \ \ .DDDDDDD DDDDDD. .DDDDDD 



1205 1210 1215 1220 1225 1230 1235 1240 1245 1250 1255 1260 1265 1270 1275 
EEPLAQPPS PRGHKWEKNLALSRELDVS IALTGKPS KTS PAVTALAVSRNHTKLLVGDERGRI fcws ADG 



RRRRR. 

! DDDDDDDD DDDDDDD 
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ANALYSIS RESULTS OF THE PROGRAM TSITES. 
**************************************** 

These are the results of the analysis of the file ~> LY1484~2.TXT 
Beginning with residue: 1 and ending with residue: 647 
AMPHI Window size: 11 

A - AMPHI mid points of blocks. 

R - Residues matching the Rothbard/ Taylor motif. 
D - Residues matching the I Ad motif, 
d - Residues matching the IEd motif. 



5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 
MWJKWQKRDISNFEYLMYI^TTAAGRTCNDY ETLNLANPKI FRDLS KPMGAQTKERKLKFI 

AAA AAAAAAAAAA AAAAAAAAAAAA AAA 

RRRR RRRRRRRR RRRR RRR 



.dddd. 



80 85 90 95 100 105 110 115 120 125 130 135 140 145 150 
QRFKEVEKTEGDMTVQOIYYTHYSSAIIVAS^ 

AAAAAAAAA AAAAA AAAAAAAAAAAA. AA. AAAA. .AA 

R RRRRR. RRRRR • - • .RRRRRRRR v • • 

DDDDDDDDDDD 



155 160 165 ' 170 175 180 185 190 195 200 205 210 215 220 225 
VRELTPEFFYLPEFLTNCNGVEFGCMQDGTVIX^VQLPPWA 

AAAA AAAAAA AAA AAAAAAAAA. . . 

rrrrr rrrrr RRRRR RRRR- • .RRRR* RRRR* - . .RRRRRRRR- 



230 235 240 245 250 255 260 265 270 275 280 285 290 295 300 
KQQGPAAVDAVNI FHPYFYGDRMDLS S I TDPL IKSTI I/3FVSNFGQVPKQLFTKPHPARTAAGKPLPGKDVSTP V 

AAAAAAAAAAAAA AAAAA. . AAAAAAAAAAAAAAAAAA AAAA 

RRRR RRRR RRRRR 

DDDDDD DDDDDD DDDDD 

305 310 315 320 325 330 335 340 345 350 355 360 365 370 375 
SLPGHPQPFF£SLQSLRPSQVTVKDMYLFSLGSESPKGAIGH^ 

AAAAA . * AAAAAAA 

RRRRRRRR RRRR 

D. ......... ... DDDDDD 

380 385 390 395 400 405 410 415 420 425 430 435 440 445 450 
SCCIXSSYGSDKVIircFENIJ^WGRCIX^VC^ 

AAAAAAAAAAAAAAA 

* * [\ \\ ] * * * DDDDDDDDDDD DDDDDD. . . .DDDDDD DDD 

..„,„......„ , dddddddddd 

45S 460 465 470 475 480 485 490 495 500 505 510 515 520 525 
LAASVTFSLLVSGSQDCTCI LWDIJDHLTHVTRLPAHREGI SAITI SDVSGTI VSCAGAHLSLWNVNGQPIiAS ITT 

AAAAAAAAAAA AAA. . .AAAAA AAA. 

RRRRRRR RRRRR 

DDDDDDDd! I \ . . . DDDDDD. .DDDDDD DDDDDD 

530 535 540 545 550 555 560 565 570 575 580 585 590 595 600 
AWGP EGAITCCCLMEGPAWDTSQ 1 1 ITGSQDGMVRVWKTEDVKMS VPGRPAGEEPLAQPPS PRGHKWEKNLALS R 

AAAAAAAA 

' [RRRR RRRRRRR 

£) DDDDDD . . DDDDDD 

605 610 615 620 625 630 635 640 645 650 655 660 665 670 675 
EI»D VS I ALTGKP S KTS PAVTALAVS RNHTKLLVGDERGR I FCWS ADG 



RRRRR. 

DDDDDDDD DDDDDDD 

ddddd 
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1 gtcttctccc tgtgtcttca cgtggtcccc gctgtctgtg cgttgtgccc tgatctcctc 
61 ttcttataag gaccccaatc atattggatt agagcccatc cacaggacct aaatttttcc 
121 cttaactctt ttcctgtttg ccccaagaac actcaccagc agctcttgcg gctgcagcgt 
181 ttaccccaaa gtaatttggt cacaaaacat ctccctttta ttattttcac tttgctttag 
241 tgtatcaact ttggaaacaa aagacatcat tctacgtata agacatcatt ctaggtatag 
301 cgttctgtct ttagtagtgg catttccatt tagaaaatat agtaattctc gatcgctgaa 
361 aatgtcaaat ccgagaaaac atagcattcc tatgtgatca ttctcaaaga gttgttggcc 
421 aaagattcat ttgattaatc ctatttttct gaaatagacg attctgatga ttgagatgat 
481 tttgatgtta gttctgttta cacataactc caagaacagt ttttaaattt tactttcaca 
541 ttgaaaatca gtcaggttgg tttatgtaaa attaaatgag ctctggccaa gcacggtggc 
601 tcatgcctat aatcccagca ctttgggagg ctgaggtgga cagatcactt gaggtcagga 
661 gtttgacacc agcctgggca gcatggcgaa accccgtctc tacaaaaaat acatatatat 
721 tagccagatg tggtggtgca tgcctgtggt cccagcttct caggaagctg aggcaggaga 
781 atcatttgaa cttgaggggc agaggttgca gtgagccaag attgtgccat ggaaccccaa 
841 cctgggcaac agagcaagac tctgtctcaa aaaaaaaaaa aaatgctggc agtgggctgc 
901 actttttttt ttcctaaaca ggaaaagggt taattatttc tttaaagaac ctagcaccaa 
961 atacagtcac attcggaggt actgggggtt gggacttgaa catatgaatt ttggaggggc 
1021 acaaggcatc ccatgacaca tgtgggaacc ggtataatct ctacagagac tagaatttag 
1081 atttaacagc ttgaaacagc ttcaagttat tggtaacata caaaacaaaa acaattcaaa 
1141 gccctttaca ttcaaattat tgaataaaaa ataactattt ctacaatgta taaaaggtat 
1201 gctatgttta ttagtcttac ttgtagctaa tctactttta tgtttcctta cagaaaattc 
1261 ttgacatttt atgtttccgt gtaagagatt gtacagaagg caaatttttg tgttatgcaa 
1321 taataatact ctaatacaaa ttctgttatg aacttcattg gcactatatt taaatgtcat 
1381 agcaattatt tttatgtggt attttcaaat aatctagatt tcatgcagca ttataaaact 
1441 aactggggct ggggacactg gctcatgcct gtaatctcaa cactttggga ggccaaggcg 
1501 ggcagactgc ttaagcccag gagtttgaga ccatcctggg caacatggca aaaccccgtc 
1561 tctacaaaaa atacaaaaca aaacaaaatt agccagacat ggtggtgcac gcctgtggtc 
1621 ccagctactc aggaggctga cacaggagga tcacttgaac acgggaggca gaggttgcag 
1681 tgagccagga ttgtgccact gcattccagc tgggtgacac agcgagaccc tgtctcaaaa 
1741 caaaacaaaa caaaacaaaa acaaaacaga aaacagtgtg agaagtgttc attacccacc 
1801 cacccccacc cccaaagctt tccaagctcc cgaccccata cagaggcaaa ggatgcactg 
1861 tgcaaagctg cctgcctggc cccaaacgtc cagtcaaacc caccactgcc cttgcatgac 
1921 ttctgacccc tccaccttca atctagaccc cacacctatc ccaccatcat ctcaagccaa 
1981 gggggacctt gaataaaggt tttcttcctg tccctgcacc tccagacaag agagtgcaat 
2041 ggtttcttag acagatcaga tcctgttaaa acatttaaca aggtcaagat aacttaaaat 
2101 agttaccaga caaaaataga tggactagta catggacagc agctaactga gacatgggat 
2161 gctacgtttc ttattcacac atcagatctt gacatcatat tagtaaaggt gtggggcggg 
2221 gacatgccat ggccagggtg gggtcgcagc tggccagact gagctggccc ttctgttgag 
2281 ttgaaccttg gggatatgca gccgctgtgc aatggtgcca cctgatggta ctttgtagaa 
2341 taaccactct cctagttttt cttcatcttt aagttcctaa agagagactg aaatttcaaa 
2401 ttggtttcct tgttgtctat tattgtcaaa tgaaaagtgc aaacgttggt actcacctct 
2461 cttgattttt atctgtctaa aaaatgaaat ctcagctata ttcactgata tattatggaa 
2521 agaacactgg acttaaggga tttgggttca gatcccattt tcggcaactt actagttata 
2581 ggatttggta aagttactca acatctgcat gcccatttta tgatctggag aatggaaata 
2641 ataggattct tgtgaggatt acaggaggtg atatatgcac atctgcgtcc ataaatcaag 
2701 cctgggtgtt gtggaggtgt gcgtcaccat catgcattct ctgaaggagg ctcagccagc 
2761 tctaccttta atggctggga tttacttgtc ttttggttcc tttccacctg tttgccagca 
2821 gaggctaagg ttgtgatgga ggccatctct cagcagatga gttgagacca tcctgggcaa 
2881 catggtcagg accacccttg ggcctctgtg cagggcagtg cccaccctga tgcctacgct 
2941 ggtcagattg agagcataga catggggcca tgacctccat aaacccccac ccagtgccac 
3001 atggcagccc aaggaaccct cttagccctg ggttcttcgt ctgtaaaacg aggctaacgt 
3061 aacaagggga gctacttagt aaaccaaaaa aaaaaaaact aaactacaat tcgtcttctg 
3121 ggatgttttg ttttgttttg ttttggagtg ttatttggct gctgttttac tctatattcc 
3181 cagagacaaa tgaagcgtaa gaggctgttc ttgcattact ataaaggaat acctgagact 
3241 gggtaattta taagaaaaga gaagttcttt tgtttttggt ttgttttgag acagagtttc 
3301 actcttattg cccaggctgg agtgcaatgg tgagatcttg actcactgca acctctgcct 
3361 cccaggttca agcaattctc ctgcctcagc ctcctgagta gctgggatta caggcatggg 
3421 ccaccatgcc cggctaattt cgtattttta gtagagacag agtttctcca tgttggtcag 
3481 actggtctca aactcccaac ctcaggggat ccgcctgcct tggcctccca aagtgctggg 
3541 attacgggca tgagccaccg tgcctggcta gaaaagaggt ttaattggct cacagttctc 
3601 caggctgtaa ggaagcatgg ccccaacatc tgctcatctt ctgaggaggc ctcaggaagc 
3661 ttatgatcat ggcagaaggt gaagggggag caggcacgcc acatggcaaa agcaggaaca 
3721 agagagacag ctggtggtgg ggggtgccac agactcttca atgaccaggt ctcacaagaa 
3781 ttcactcact atgatgagga cagcaccaag ccatgaggga tcggctccca tgatcccaac 
3841 acctcccacc agatcccacc tccatcactg gggatcacaa tgcaacctga gatttgggcg 
3901 aggacaaata tccaaactat atcaagcatc ttccttcccc tttctccctt cccctcctga 
3961 ttctggggag aagaggctgc tgtgtgtttc tgaagggctg agtgttctca cccaccccca 
4021 accccccaag taaagaactc ttatgtttca ggcccatggg gtgtggggag aggtggtccc 
4081 agaggagcag ggcgtgagat tggcagcttt agttgaagag gctgggatag ataggggggc 
4141 aggggtggtg gtctcagcaa atggtgagga gtgaggggga agggtgtcca aggctgctca 
4201 gggggtggag catgaatacc cagaggctga gcttcaactc atcagcactg ctcccgggga 
4261 ccacagtgac tgaagagtga acctctgagc agtgctcagg gcgtcctgag gcccggagag 
4321 ccagttctct gaatggtgca agaccctgtc aggtccacag gaccccagaa gccaggaagc 
43 81 acaacaaatt ggatttccgc ccacgctgag tagatgggag ctagaaaacc aaagaaggcc 
4441 aggtgtgttg gctcacgcct gtaatcccag cacttcggga ggccgaggcg ggcggatcac 
4501 ttgaagtcag gcatttgaga ccagcctggc caacatggtg aaaccctgtc tgtattaaaa 
4561 atacaaaaat tagctgggca tggtggcatg cacctgtaat cccaactact tgggtggctg 
4621 aggcatgaga atctctgaac ctgggaggca gagattgcag tgagctgaga ttgcaccact 
4681 gcactccagc ctgggcgaca gagtgagact ctgtcttaca aaaaaaaaaa aagaaaaaag 
4741 aaacaaagaa acaaagaaaa agaaaatcaa tggcccttgc gccagagtgt cttggaggga 
4801 gtccacactg acacgcccac ggggggagct gctgaaagca ttccccagtc cagtgctgcc 
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4861 ccgtgagcac cgccacctgt ttcccgcttt cctggccacc gtacatacat ttgttcatgt 
4921 ttatagactc tgaatataga gaccccacac aagaagagaa aggctcagaa aaacaatgtt 
4981 atgacctgct tcctgaggca ccctgcctgc aggaagtgct tttcacagca agaaattctg 
5041 actccaggac ccatattttt agcaggaaca tggtcagtca acacacatcc tctaggcctg 
5101 ccctggacca gatgaccagg aggaagcagc cgtgagtcac agggagaaag ccggtggctc 
5161 tgacagggac gcctgctgct ccgccgggga acagctttcc cctggtttca gttagaaagg 
5221 tcaggggacg cctgaccaca caaggctcac gggaactgtg gagggcggcc cctcaggcca 
5281 caatcagcag gtgcccacca tggcttctct cctggcaggt gaggacttca gtttttctcc 
5341 atggcttggc tcacctgttc tcttattctc agagccataa tcaccctcct cctgccccag 
5401 cctggagctc tctgagaccc tccactatag taaccagcac agcctagaga gggtaaatcc 
5461 tccggtgccc ccaacatccc accttcatgt tgcagcgagg cccaaaatgt cccagaagaa 
5521 aaacaaactc catcgcacac caaaaaccca ccaaaaaacg tgaaatcacc ttcataattt 
5581 gggaagtata aacatagttc tatttgttct gtaacgtatt tttcatgatc aggaaatgaa 
5641 gacaaagaaa ggctttggga attgttttat ggagaggaca ccatcttcca ggtcctgcca 
5701 gctcagggct cccatctttc agcccctcca gagctgagcc cggagggctg gggccgccct 
5761 ctccctgctc ctgagccaca gaagcccaag gctggccctg aagacctaac agcctcaggc 
5821 atttgcagcc cccttggcct aaaagggctg ttaagagagg tgcagacgtg ggccctgccc 
5881 attcaccctc agaggaggtc gcgacttcca ccccagcaag agtaccccta ggactgtgga 
5941 gtgcagagga cagcgaccgt tgactaggta cctggtgcag ggtcttgccc tccctccctg 
6001 tagcccgata tgggaagggc cacagtccaa gtccatgttc atgctgctcc tccaggaggc 
6061 cttcccaaac atcttctcga tgccgagtaa ctcacagatc ccatatagaa atagaccttg 
€121 gaagggatct ctcttcttgc aaatgaagaa actgaagacc agagatgtac ctcaaatcac 
6181 acagcgagta accggcagag ccaggaggct cctctgcccc caggccagag agggcttgca 
6241 atctggcttc cgagcagtgg ttcccaactc ccgggccatt acaatcacct gggatcccca 
6301 ccagtggaat gactctctgg gggtggtgct tggaaggctg gtggtggtga aagttcccct 
6361 ggtgattctc atgcagccag attggagaac cctttagatt caggcatgga ggttccacct 
6421 ggttccagat cacatggcca tttggtctca agaaagtcac agccctcctg agtctcactg 
6481 gcaaattgct acgtaataac acgcatcttc ttgataagga accgatataa ggatccagac 
6541 cctggacctc ctcaggaaaa cagagcacgc attataagat acaatcgaat aggagaagtt 
6601 tttcataagc ttggagggct atggttattg gattcaaaag aatagttttc tctcttgtca 
6661 cataccaatt tgaaaagtag taatacccat tctttaacag aagtgtattt ggggccagtt 
6721 gcggtggctc acacctgtaa catcaatact ttgggaggcc aaggtgggtg gatcacttga 
6781 ggtcaggagt ttgagaccag cctgtccatc acagtgaaac tccgtcttta ctaaaaaaac 
6841 aaaaattagc tgggtgtggt ggcgcatgcc tgtagtccca gctactaggg aggctgaggc 
6901 aggagaatca cttaaacctg ggaggtgggg gttgcagtaa gctgagatcg tgccactgca 
6961 ctccagcctc ggcgacagga gggcaaccct gtctcaaaaa aaaaaaaaaa aaaaaaagcc 
7021 tatttggaat cagtcccctc ccacggggac ctctttcagg aggtcaggcc acagcaccgt 
7081 ggcaggcatg tggtagcccc tccataaatg gctgagcgag tgcagttgaa gtgggagtct 
7141 ggtgttcttt gaatacatcc tgctgcattg gaagaagcct gtgcaggact agacagttgg 
7201 ccttctgtgg caggctcttg cctccaggcc agttgagtga gtgcagatgt cctgagagac 
7261 cagcagtgca gccacacccc atgggactgt gaggcgctct cacagtgaag accaagcgca 
7321 ggtagcactg gctttgtggt tcggagaaaa gtcgacactc atttccttgg ggaaaaaaag 
7381 tatttatcaa aggaagtctt tccctcttgg caaaaaaaac agagacaagt aatttcctcc 
7441 agcagtctga gctgggattt gggggcaaat aaccattaaa cagagacacc tgagggtgca 
7501 cagatggtca ctgtggcaaa ggtcggctga agaataatag gtaataggga gccactgtgg 
7561 gttcttgagt ggaggagggg catgataaag cagggctgga gagcagaggg agaacggtag 
7621 agcagaatgt cacaaggtgc tggggccctg actccctaag tgcaggaaac tgctgagtct 
7681 tgaggtcact tgagcctcct cctgcctctg tgatagctcc cacctgccac ccccacaggc 
7741 cacacggtcc aaatgtctca tcaactgcta gcatacgcta agcagaagaa agcgtgtggc 
7801 tgtcccattt tttatgcacc aagccgtgag gccagaggca ccacccctct gcttggagga 
7861 ttctgagggt ggtggtagtt gtcatctttc cagggaatgt gtggccttct agagcctaat 
7921 aatagtgata accataactg ccgcttatgg aggacactca tgagacggta ccgagctcga 
7961 attcgaatac atnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
8041 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnacccaagg 
8101 gaaaaacagt gtgaagtcca gttttcagcc tgcccaatga agcctgagct acagcaacca 
8161 tggtaggatg aggaaccaat gtctgcccat tggcaggagc catagataca aacccaacca 
8221 cttgctccaa aaagtgctgc agctgcggct ttcacactgc actttgcatt ctcctttcag 
8281 ctctgttgcg gaggtagccg ctactaagct tactttacca gttactaatg tgccaaaaat 
8341 ggatccagcc acattctcca tgagcttccc acccccagga agtacccctg ttcagtagac 
8401 tccagaagca aaagacacct gccagaatct gttctcttcc tccacctccc acctcccctt 
8461 tgcatgatgg atacctgcac ccttctcgct cctggagtcg gacatatcca aaatcggtca 
8521 tttggggtac gggcctaata gaataagctt taatgtaaac ccaagtaaag catgtcttat 
8581 gcctcctgtg ccaatgaatt gaggaggcat ctacccaggc gaggctagcc aggtgcccca 
8641 cagtacccca cgggccactg tgttcctctt gattagaaaa gttcctaact tattgcaacc 
8701 cagggtttac atccccacct gactcaggcc ccttaaggca ggaaccttat cctgtttacc 
8761 aacaactctg ggtaatacat agcttttgct agtaacaggg gctactaaca gctgggcaca 
8821 gtggctcatg cctgtaatcg gagcactttg ggaggccgag gcaggtggat cacctgaggt 
8881 caggagttcg agaccagcct ggccaacatg gtgaaacccc atctctacta aaaatacaaa 
8941 aataagcggg gtatggtggc gcatgcctgt aattccagct actcaggagg ctgaggcagg 
9001 gaggattgct ttaacccggg aggcggaggt tgcagtgagc tgagatcaca tcactgcact 
9061 ccagcctggg tgacagagcg gagtctcaaa taacaaaaac aaaaagacac cttagtgtga 
9121 atttaagatg aagcactatc tcccccaagg atattggctt gagggatgca ggagggtgca 
9181 tgaagagaga agtacagggg gctgcaatcg ggacacgggg gacaccaggc tgaaaacagt 
9241 acgccttccc tcctcggtaa aggtcagtca accctggtgc tatttccatc ccccacccca 
9301 ttcccttttt ttttcttttt tccttttcca ttttcaggat tttcatgtca taactggttt 
9361 cttgataaca gcaatgtcac atgcctgaaa aaagtaagca cttggtcatt cagagaaaat 
9421 tacccaattt ttggtttccc cttgaacttt gttttcctgc tctgatgcca acctctcagc 
9481 catgccaccg ccctttagtg ccgagatcag aagatgaaaa gaaaggaaaa gcataaaatc 
9541 agttggatat ttttaaaagg ccaatggcat ctcttagata aatgggtttc acggggaaag 
9601 agagttagga ggccattaaa tttctttaac tctccctcac ccatcagcac aagcaaatta 
9661 aacaaagaaa aaaaggaagc caccagggca ataaattctt aacacaaacc aaaggccttt 
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9721 tctagttgct gtatcccctt tccctaaatg aaaaagtatc tcactaaccc tttcttcata 
9781 tttttcttca cttactttct tctaatcact ttaggatatt agagctgcta ctccaaagtg 
9841 agaaattgca ataaatgtgc ttaggcccac atttacataa acttttcttt ttgggggggt 
9901 agagtgtatt attatattca cagtgtaact attgctgttc aggggtgact tttcagacac 
9961 catgaattca acagcatcct cgaataagca tttttctttt cccaattgag acaagatttg 
10021 gaagttaaca aaccatttaa taatatggct tcattttgtt ggcatggtct cactcactca 
10081 cacacacaca cacacaaaaa acagtaggaa aacactttcc atatagtctt gctccccaaa 
10141 gggtgatgat tcgggtcagg ggacaccctg taagacaatg cgctgcatgt aacctccatt 
10201 tcaagccaaa gccaagacaa tgaagacggc tcccacaggg tgcacacctc cagcaagtga 
10261 atacagtagt agcagcagag catggtcgtg gaagttgggg aaaagcgaga aaaccaagtg 
10321 ggcaaaatta tgtcattttt gtatttttca aaatgaatgt tctttggact tacattagca 
10381 gctagaaatg aatgtaaatg ttctttcact tcagcaaggc attcttggaa ggatgtcatc 
10441 ccactgttaa taaaaataca ggcggggcac agtggctcat gcctgtaatc ccagcacttt 
10501 gagaggccga ggtgggcaga tcacctgagg tcaggagttt gagaacagcc tggacaacat 
10561 ggcgaaaccc tgtctctact aaaaatacaa aaaagtagcc gagcatggta gcgggtgcct 
10621 gtaatcccag ctacttggga ggctgaggca ggagaatcat ttgaaccagg gaggccgagg 
10681 ctgccgtaag ctgagattgc accattgcac tccagcctgg gcaacaagag caaaactctg 
10741 tctcaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaccacat agagggccac cccctaagaa 
10801 aaacttaaaa ccactgattt gtggtcaggt agcaacatca gggtggaaat tttaatgcaa 
10861 acatcatatt tggggtacaa aagtaaatta accaagatcc ccggagacta tcacattact 
10921 aaaaagggca gtgaaaacac attttcttga taaaatatat gacctaattt agccaatata 
10981 agcatggtta cattttcctt agtcctactt ctctatcatt tttaagacca tgtgacaggt 
11041 gtctcaggtg tcatctcaga tgtgccatgt tatacactgt gttctgtctg cagggcttca 
11101 tcatacttcc agaataagtt taatctacaa aaatggactt atgttaacac aaatatcaac 
11161 cacaaaacct taaggatgtt gttagctttg gctttttgaa ggaaaattct ccacatttat 
11221 gcaactccat tatctacgtg ggggagtctc cggagggtct ccaaaagctg accaaaccct 
11281 aaagtatttt ttctcctttt aatctgaaat cattcccgaa tatgacaagt caaattatca 
11341 cttggttttc cctgaatagg acaaccctgt attataacct tgaccctgac aaggactgct 
11401 cggttctaat gcctacactc ccacctcctt ggcagaaagt gccctggcag tattgactgt 
11461 aaacagagag gtagaaacaa agggatcttc agtaaaggca ccatctactg tctaaagttg 
11521 gaaagcacca atcaacaggt tcttcaggtg gagaggtgga ttagggaaga gctatgagga 
11581 cctgagatga gttctagatc aggactgatc ttgagccgtc cgaagagtgg ctggtgggag 
11641 caactctcaa agcagaggtt ttcaaacttg cctgcacatt agaatcacct gcggcgtctt 
11701 aaaaattcca ttgtccaggc tgcaacccag gcaagccgaa tccagatctc tgggagtggg 
11761 acacaggcac cagtattttt taaaactctc cagatgtggg gccaagtttg agaaccaatg 
11821 cccgcaaaag cattcccagc gagctttgag caacgttttg agcggcataa gcagagtcgg 
11881 gagggcagcg gccttagggt gaagaacaca caaagcaaat atacaaagtt tcaaaaatcc 
11941 tgtgtaacta actccaacat gacagccaag tacctgtcta gaggccaaac agtgatgcaa 
12001 cttggaaaag tcccggggag agctagaatg tgtgctccga gtgaggcggg ggctctctcg 
12061 gggatgacct ccctgggcgg cgcagtggtc tggaggatcg gggagagaaa ggtcacacgc 
12121 ccctttcttt ccaggggatg gtccggccga gcacgccgag gacttgtgta cgacaggata 
12181 ggtcagaaac tgcttcgtgt gagtctctcc cagctgtcac tagagccgga gggtccaggc 
12241 aggggtgcga aggcagaaag tccctcttca ctaaagggag gttctgcagg gcctcctccc 
12301 tcccaccccg ccccgaagcg ggttccccag ggatcgcccg catctgttct gcgcctgccc 
12361 tcggcggggc gcggcgcggg agtccgggtc gcatcttcga gccccaggac acgcgcggcg 
12421 cgggtcaccg agagtggacg cgcggactga gcacagtgca cctggctgag cccgaacccc 
12481 tgggggtccc gccgccctgg gcggccccta gcaccccgaa acccggctgc cggcgcacac 
12541 cccgccccgc tccagcccgg agctcactca ccgccgccgc cgcctcctcc cgcgtccagc 
12601 tcccgccgca gccagcagag cccggccgga gtgcaagggc cgcggcgccg ccgggaggcc 
12661 gagcgagccc gcgccccgcc ccgcagtcgc cgcccgccct cctgggagga ggggcagggg 
12721 ctggccgggg gcctggaccc gccggggctg ccggcgggcc caaccctccc ttcaatccgc 
127B1 ccgcggcccg tcgggacgaa cctacccagg aacccccgca ccacccgccc tcggctccgc 
12841 tgggcccggg gctgcgctta gacctcaccc cttcccggag ccgttcgtcc ttccacctcc 
12901 cctccgccgt caccgggcat ttttcttctc ccagcctcgg cctctccaaa ttcctggcct 
12961 gcaggaagcc ctccctattt cgcgctctcg actgcgccga gctcagttta cccctttccc 
13021 ttcctttccc tgcttagcct ttttccccct cactgtcaga aacttaaggg atagcgctct 
13081 tctgccaccg caaacccaca cctatttatt gagcacctac tgtgtctaga cactggtggg 
13141 gaggttatcc cacttgcgcg gtaacaagaa atgtctgtct ctgttggaga aacagtaaag 
13201 tgaggcgcct cgtaaggtca cacagctggc ggtagtaggg ctgggattcg aatacagttt 
13261 ttaggtcctt gtttagaccc accccacgat aaactgcggc acgcctctcg ctagctgtga 
13321 gaacttgggc aaaatgttat cagttctaaa ccccagttca ctcatccgta aaatgaaaag 
13381 ctggatactt caccagctat gaggattccc tgagaaaaca acaacctgta cttttttttt 
13441 tttttttttt ttttttggag atggagtctt gctgttttgc ccaggctgga gtgcagtagc 
13501 gcgatctcgg ctcattgcct ctgcctcccg ggttcaagcg attctcgtgc .ctcagcctcc 
13561 cgagtagctg ggactacagg tggccaccac tacgcgcggc taatttttgt gtttttagta 
13621 gagacggggt ttcaccacgt tggtcaggct ggtctcgaac tcctgacctt aagtaatcta 
13681 cccgcctcgg cctcccaaag tgctgggatt acagatgtga gccactgcgc cctgcccaac 
13741 ctgtacttaa tacatgtttg ctgacatatt gaatgaaata gacccttgaa aacttaacaa 
13801 ttcaaaacaa tataaaagga cttgaaaaag taataggttg gtgcaaaagt aattgcagtt 
13861 ttgctattac tttcaatggg aaaagccaca gttacttttg ctccgaccta ataatagtga 
13921 ttcattgcca ggaaagaaag aaaaaaaaaa aaagatgcta cttatcccaa gtgggcagca 
13981 tggtcaaggg agtctttttg gaggaagcgg tattgcacta ttccagagct agcaggtcta 
14041 ggttggggaa gacagccaag gtgatggggt gtgtgaacag tgcccagagg gtggaccagc 
14101 tttcctgagc agaggctgcc tggggcagga cacacatgct agaaaatggg atgaggcata 
14161 ctagaaaggt ggatggggtc agagggtgga aaaggcatca aatgccaaag tgagggggct 
14221 caaggcaacc aaacagtact tggacacctc aaaatgacta gaatggaacc tgcatgcaat 
14281 agagggcacc taaatggccc tgggaaccag acgggggaga cccatggcct aggaaccagc 
14 341 agagtcatgg ggtagatggg gagttgcagc agggagggac aggaaagaaa gaattttcca 
14401 actgggtgca gtggcttatg cctgtaatcc caacactttg ggaggccaag gcagtggatc 
14461 acctgagcac aggaattcaa gaccagcctg ggcaacatgg cgaaaactgt ctctaccaaa 
14521 aaaaaaaaga aaaagaaaaa gaaaaaaagc gcatacacac acacacacac acacacacac 
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14581 acacacacac acacacacac acacacacac acaaattagc caggcatggt agcatgtgcc 
14641 tgtagttcca gctactcagg aggctgaggt gggaggatca actgatcctg ggaggttgag 
14701 gttgcagtga gctgtgactg ccccagtgga ctccagcctg ggcaacagaa tgaaattgtc 
14761 tcaaaaaaaa gaaagaaaag aaaagaaaga gagaaaggag agagggaggg agggagggag 
14 821 agaaggaagg aagggaggga gggaaggaag gaaggaagga actaatgaag gaaggaagga 
14881 gagagagaag gggaagggaa ggaaaagcag aaggtaagga aaaggggaag ggaagaaaaa 
14 941 ggggaaggga aggaaaaagg gaagggaagg gaaggaaacg gggaagggag ggaaaaggaa 
15001 gggaaggaaa aagggaaggg aagaaaaaga gaaaggaggg agggagggaa ggcaggaaaa 
15061 ggggaaggga aggagaaggg gaagggaagc acagggaagg aaaaggggaa gggaagggaa 
15121 aagggaaggt aagggaaggg aaaagggaag ggaagggctc ttgtttggaa gaccagatgt 
15181 cttggtgacc agatgtgtgc aatgagggag tggaactggc aaaggcaaag ctgaggcttc 
15241 caggctgggt gccttggagc atcagaaata ggaacaatat ggaaagaggc tgcttttcag 
15301 gagagctcct gaaggttcca tggagcatga agtaatggtg gaacttttca gtggaaatgc 
15361 cccagcaggc agctgggaag tggaggagtg tctctgggca gcatccatgt ggagggtgag 
15421 gggagagtga ggaccaggag ccaggaaagg agagtgcagt gccttcactg agggtggagg 
15481 aagcagaggg catcacccaa'gggagaaaga ggagggtaac tgagagctgt tgggaagaac 
15541 ccaggagtct ttgacggaag cccaagggtg agggactcct gaaggagagg gcagtgccga 
15601 atgtcacgaa tccacttptc acgtgggccc ttccaaggat ctgtagagag tgtgcactca 
15661 ggaaatattt gttaactgtc aaaaaagcat ttcatcagca atttcccctc ccatacagat 
15721 attttgctga tgacaaaaat aagaggagtc attggaggta aagacagagg tgagagagat 
15781 cagctaccag agatactgct ttcgttgctc ttattgtcaa gatgaatgct caagccccac 
15841 caacctcaac ttaaagcaaa cagttccagg gaatgcttct gaaaaacatt tttgtctttt 
15901 tttttattta atgtaagtct ggctccctta attatggttt ttgagccagt agaacgatta 
15961 aggggaggaa gagcggggtc agtgccagca acttgccaaa gagaagtttc caggaggatg 
16021 agtagagaaa aatcacctct tttctcctcc aaattttctg tacataaaat acgtgataca 
16081 aaaatattca cataagtgct acagtataaa tgacatttga gtagtgacac agaatatagg 
16141 aagggtgaca gaagggaggc tgcagcatgg aataggtctt atggcgtgca cttggaagac 
16201 gtaatgcaag gccaggcgtg gtggatcaca cctgtaatcc cagcactttg gaaggccaag 
16261 atgggtggat cacttgaggt caggagtttg agattaccct ggccaacatg gtaaaaccct 
16321 atctgtacta aaaatacaaa aaaattagcc gggcgtggtg gtgcacgcct gtaatcccag 
16381 ctactcggga ggcagagaca ggagaatcac ttgaacccag gaggcggagg ttgcagtgag 
16441 ccaagatcac gtgatcgcac tccagcctga gtgacagagt gagacttcat ctcaaaaaaa 
16501 aaaaaaaaaa agaaaagaaa aaagaaaaag aagaagatgt aatgcacatg gataacagaa 
16561 agagtgcgga gggcatttct ggaaggtgtg gtcatacacc cgagacaggg aaacaggagg 
16621 aagtgaacaa aacaaaaaat aatacacggt ttatttgatt gaagtggaaa agctattccg 
16681 cttgtgaagc aagatgggat tgggaagggc acctgggaag agggcttttg cttggagtca 
16741 ttgggaaaca ggctgcctta atacgggttg ccctgaaggg ggagcctgtg acaagacttg 
16801 cctgctggtg gtgtatttag ggtgatccct agaagcagga gagaggaagc aggaagagtg 
16861 cgacaggcaa ggaggaaagg ccagtattag ggtgtgttaa tgagactggt gctatatgca 
16921 gggggcctga ttcccctggg acctcctggg aagatgctga atgcccccca gaattgacaa 
16981 cctgaagctc aggtagctgg agtatttttc caagtctcca gcaaagaccg ctcctacatg 
17041 ccttacccct acttctaggc tacccacctc ccatgctgag agggctcctt ggaacaaaag 
17101 tgctcacttg tgatggaaag ctgtcagcac aaagttgatc tgagcttcca gggagctctc 
17161 ccagcaccta tagcttctgc acctcccggc tgctgcagtg cacatggttt caagctctgc 
17221 ccctgctgtg ccaggactcc ttcaggacct ggtgcctaca tcattccaca gccagcagcc 
17281 tctcctggca ctatccctct actccaagct gttgtgtggt ggagcaactc taccaaaact 
17341 ctgcttcccc ataaacagct ttccccagaa ccccagaggt cagattacca gtaggttcca 
17401 tgggcatgac gccatagaaa cttctttccc ctccagtgag acccagctgt gtcctctcca 
17461 acaaggtctg gctctgagca ggagcagagg ggagactaat tccttgggtg ctcaatctca 
17521 tttttggact ttgcacactt cccacatgaa gtgaagaatc tcagtcttaa actgttcact 
17581 tttcaaagtg ttcactccta caagtgttga gggaagagac ttaatacatc aaacaatcat 
17641 attaagtttc cgtttgaact tctagagccc cttgggcagg caaacttgct aggcctacgg 
17701 aaaaaagaat caaacccctt ggcaaggggt ggcaagccct caaaatctgg cc ctgactgg 
17761 cagataagtc aatgagaggg ctatgcccct agccagccat ccacactctg ggccttctgc 
17821 agctctatta niuinnncnnn gnnggggnnn gggggggggc gggcgggccn ngccccgcgg 
17881 gggcggcggc gggcgcggcg gcgggcgcgc gggcccccgc cccccgcgcg gggcggggcg 
17941 pSgUgpg IUgScS Sggggccggc ccgcgngccg gccccgcccc gccggccccg 
18001 gccccgcggg gggggccccc gggccgcggg gccccggggc gggcggcccc ccgcgcgggg 
18061 gccccgcgcg ncggggcccc ggggggccgg cggcgggggg ccgcgggcgc cccgcggggg 
18121 qqcqqcgggg cgcccggggg ggcccgccgg gccgggggcc ngggggggcc ccgcccgccg 
18181 tlcUcpSt ttcccllttt Icccgcttcc cgtttccccc cttgttcccc cttttccccg 
18241 ccccgccgtt tcccgggcnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
18301 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnta 
18361 tgatacgaat tcgatctcgg taccctctta gatgccagcc cctccgtctt gcactttcca 
18421 gcctctagaa ttataagaaa taaatttctg ttattctttt ttaattactc agtctcaggt 
18481 attctgtcat agcatcagaa tacacactaa gatacctgcc taggccatga ggaccccaag 
18541 acacagcaca aattttgtac aaactctaac cagaccacct ttttcctact cccagggtga 
18601 ggtccttcct tcccctgtcc cttgcagccc tatgggcttt tttgctttct ctcattctcc 
18661 gggccttctg cagctctatt tcctgcaaga ttgcatcttt tctcctggat atttggtcgt 
18721 cgctgggctg aggtggcagc ctagtgttgg gagcaggctt ctggctggcc tctgatcagt 
18781 aagccagagg ggtcaccttg tcctgaaata caacccagcc attggccatg ttttccacca 
18841 tcttatgagt attcccagtg accaccctca gcagttcaag acaagttggg ttacacaaca 
18901 aggcagcaca ataatgaagc tcccacccct ggcttgggcc tgggcgttag aaagccctgc 
18961 ctgcgggggg ccctctcaca tgcagaaggc aatgggtagt agcttcacca gacgatcctc 
19021 tgcctacctg ctccaggtct ccccctgggg ccatcccacc tcttcctcat taaagcctgt 
19081 gctgaaagga gcggggttgt ttttcaaatt ctttcagcag tgagttattg atcaacactg 
19141 atagtgcgag cagcttgatc tcaccctgaa ctggcagaga ttaattagct tga^tcccaa 
19201 gcataattcc taattaggca caggaagaaa ggaaagaaaa cacaaaggaa ctattcttct 
19261 ccaggaacca ccgaccttct gtccaggggt ggaggaggaa gaaacaggta agcatgtccc 
19321 agcaccagag tgcttcctgc ccattccaaa gctgggcagc cagcccgtca gcacaaacaa 
19381 caccgaggac ggtgccagtg ccatcctgca agggtctcag aagaaaatgt gcatttaatt 
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19441 tgtttgcttt actctctgcc taatgatggg cagggaaggc acaattaagc aggaggcttt 
19501 gaagtacctg ctctaggctt gcttagccac cagaggctga ggatgcaagc aggggagagt 
19561 tcagctccaa taggctgggg aagaaaggtt ggatggggag ggtgtagtgg ccctgagact 
19621 tgagcagaaa tagtgaaagg aaccacttcc tgcagctgag aggcaggagc catcatgttc 
19681 tgttgtgggc attttctaga atgttccagc caggaccagc agctcaaatg gctcttccct 
19741 ggtggtgggt agggcaatag gagataggag ggaccttcaa tgtggcaact gtgaggtttg 
19801 gccaatcaac ttacccatgc tgagcttcgg tttctcttca gtgaaatggg gataattaat 
19861 gatcgtatca gcttacatga cggttgtgag aattaactga agaattcaga gatgcatggc 
19921 acataggagg cacttcctaa cccacctaaa gaaagcaaac ccacagggca gcaaatcaga 
19981 aatggatgag ggctgacctc ctctttattc agtagcacat gtctaataga cccgactgtg 
20041 tctcgcacat gctgcccctg aggaagactt attataggga gctacaaaca gtgtcacctg 
20101 ggaaagacag tgaactcagt tatagaacaa gaacacctgc ccgggctgac tgggggccag 
20161 gatgcaggga aggctgcctg gaggaagtga catacccctg agtctcaatt ggccagggaa 
20221 acaactgagc aatgatttgt gaaaatgagt tcagtagagg ttaataaaaa ttgggagtcc 
20281 tggccagtgg ctcacgcctg taatcccagc acttcgggag cccaaggcag gcggatcaca 
20341 aggtcaggag ttcgagacca gcctgatcaa catgatgaaa ccctgtctct actaaaaata 
20401 caaaaattag ccaggcatgg tggtgcctgc ctgtaatccc agctactcag gaggctgagg 
20461 caggagaatc gcttgaaccc aggaggcgga ggttgcagtg agctgagatc atgccgctgc 
20521 actccagcct gggcaacaga gcaagactcc gtctaaaaaa aaaaaaaaaa agggaatccc 
20581 tgctgtctgc agtaggccaa tcaagtgtcc aagaattgtg agaatgtgct tccccatcac 
20641 caccaccagc ccacacccac accaaaatgt gtcacccttc aatcccttca agtctgtctg 
20701 ctcagcatcc tgatccaaac aagccacttt aaaaagtgca gttatgaaac aattcgaaca 
20761 ctgactggac tactgtttat atttttagat gtgatactgt ggttatgttt ctctaaaagt 
20821 ccttgtctgt tagagataca cactgaagtg tttacagaga gaatgataca atcgctgtct 
20881 ggaatccgct tcaataccat ccattgttgg agggtgggta gtggggttag ggggttttga 
20941 ggaaacaaga ttggcccggt ctatcagcat tgatgctggg tgagataggt aaatgggggt 
21001 tcattataca ctttactttt gaaaatgaag tttttcgtaa taaaaagttg aaagaaagaa 
21061 ggaagggaga tgggaaggca gggaaaggag aaagggaagg agagaaggaa aagattctgc 
21121 tcaaatgtcc ctttggaaac cttccttgac ccccgcgttg gtgtggggcc gggggaatgg 
21181 ggaggctgtt ttctgctcag ccagagcaac aaggcccatg aggagacagg agggatgtct 
21241 ggggagcagt cccaagccac agagtccttc tggggaccag ccacaccatg agagagagca 
21301 gaacagggag catagcacag acctatgaca tagcactgtg ccatgacatc agctggcaaa 
21361 ggcccatggg accaggaggg aggcattaga agggcgaaca ctagcagctg ttgagtagga 
21421 agaaatgtga gccgacttct ctacctaagg aagcggtggc aaggccgtgg aggttcggga 
21481 tggggacggg ttccaggatt a^agtgcttt tttttttctt tctttctttt gaaggaggag 
21541 aggctatggg agcctcaggt accgagcatc tggtgtctgg agagatgcag ctactcaagg 
21601 cagtttctga tctggaacca ggaagtctct gtttccccac cagccagcaa acgttctgga 
21661 cttggtgctg ggcctgccac caccaaaggt ggagatgccc cacatacctg cgtcaggcat 
21721 gacaggtcac ccctgagtcc ttgggtctcc ccaaggcttc tccagggctt cccaacaggg 
21781 ctgagattca agaaaacgct caaggtccac cccactgctc cgaccctctc accagaagaa 
21841 aagggctgtg tcccaagggc agcctgcttg ggaaaagaat atgtggggtg acttgtccag 
21901 atctgcccag ggaggctggg aatggagaga gaaaggaatc tgcccaccag ttgggggaaa 
21961 agtggatttc gcaacaacct tcatgatatt aaatgcttgg ggggcctggg aagaccaaat 
22021 gtttaatatt tggagaagag gctgtctaaa cagagtaatt aggaggctcc tttgtagtga 
22081 tatgaaagag atgaattggg tatatttgac atcaaagaca ggagatccag tggggaatga 
22141 gacccacggc cgggcgtggg cagcgctgga ggacaggctc tcccagtgtg ctgcggggga 
22201 gggcagcccc tgcagtgtgc gtgttgagtg ctgggaaacc cccaagtaac acttcccccc 
22261 tcatttaaca cacttcctaa aacctctcct ccccaaaagc cactttgcaa aagcagcgat 
22321 taccaccaac cacgagaaac acaggagcct gccttccact gtttaccact gggcaggaag 
22381 tcgtttatgg accagtctct tgcccaggct aaactcagaa gtggacactg agggccggtt 
22441 gtgggtgcca ccaggcaagc tagggcatcg gagttcagca gagctgggtg ggaattctgc 
22501 ttctccctcc actacctggg taaccctggg caagtcactt agcctctccg agcataaact 
22561 ggtttgcttg taatatgcaa ataataaggg gtgggggaaa tttttttttt ttttttttga 
22621 gacagagtct cgctctgtca cccaggctgg agtgcagtgg cgtgatctcg gctcattgca 
22681 acctccgcct cctgggttca agcaattctc ctgcctcagc ctcctgagta gctgggatta 
22741 caggcgcccg ccaccactcc cagctaattt ttgtattttt agtagggacg aggtttcacc 
22801 atgttggtca ggctggtctc aaactcctga cctcgtgatc tgccctcctt ggccttccaa 
22861 agtgctggga ttacaggcgt gatggagatg atactcccta aatcacaagg gtgtggtgtg 
22921 aagatgaaat ggctgcacgt gacagaggag ttggagaaca ggaagcaagt ggattacttg 
22981 gatgaactgt gatgggccca agaagaacta agccaatcac tccagggggc gttttcccag 
23041 gatccaggca gagagctgat tccagcatcc acgttgtatc cccctcccgc tgtggccggg 
23101 gtctccaaga gccgggcacc cactgcagag cttccaagcc caggcccatt tgaacacagg 
23161 aggacttttg aggttcattt aaccccaaaa tcagtgattc tacatctcaa gctcatattt 
23221 tgaagtgttt acgggtaagt catatgatac tggtcattta ttttttaaat attctagcca 
23281 aaagtacatg gtagacaaag tgagaatgtc agaatgttga taattagtga aactagacga 
23341 agatacatga aagtttaatt ttgattctca cctactttca tgtgcattag gaaactttca 
23401 taataagtta aagtaacaaa ataggccaga ctcggctgct catgcctgca atcccagcac 
23461 tttgggcggc taaggcaggc ggattacttg aggtcacgag tttgaaacca gcctggccaa 
23521 cgtggtgaaa ccccatctcc cctagaaaat aaaacaaaaa aaaaaaattg tagctgggca 
23581 tggtggcagg cccctgtaat cccacctact caggagggtg agggaggaga atcgcttgag 
23641 cctcagaggc agaggttgca gtgagccaag attgcgccat tgcactccag cctgggtgag 
23701 agagcgagac tccatctctg aataaataaa gtaacaaaat acatttcagt ctcgagctcc 
23761 agttccctgc caggaagagc tgaagctgag ctctggcacc ccctgcttct tctccctcat 
23821 ccctaaaaca tgtggggtac tgggtgccaa aggagatctg aagatggcca ccactcagac 
23881 cagcagtaga ggccaagaag gtggtgagaa attttaaggt attgtagcat atgctctcca 
23941 gggtgaggac acaggatgct gggacctaag gagggagaag ttaacccaga gagggcccaa 
24001 gtcagggagg acctgactgt ggtgacagag ccagtcatgc taataatagg agcagtggtt 
24061 ttgcaatcac aaatattggt ctgggggact gcgggcccaa tgtggtggga gttttagagc 
24121 agggacccct ccactgtacg tggattctta tacttgctaa gttgcactga attctcaatc 
24181 aacccatttg gaggctctca ccaagacagg agtccctttg aattggtagg atgttaaaag 
24241 cttcctccag ccattgaccc tgggtcccat tggtagaaat cagataataa atcagatttt 
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24 301 ccaggtaatc ctcagtttaa tgtccattcc ctcattcaat caattattca attacaaatc 
24361 atttttaata tgattttatg gttcaggaat tcaggcaggg ctcagatggg cagttcttct 
24421 gccccacatg gcattggctg ataacaggct accccaagga aatcacagtg taatgggaga 
24481 tgggaaatag gttaacacaa agttactata ctacaagaaa aatcttcaaa tcttcaatct 
24541 tcattcttca tgggtgtgag tcttctctgg tggagaagag agttctttgc taaattatat 
24601 tcctgtgtta gccagccaag cacacacgga gtacagctgt cccttgctat tcatggcaga 
24661 ttgatcccag acactgccac ccccagccct gtgaacacca aaacccatgg atgctcaagt 
24721 ccttgatata aaatggtatg gtatttgcac ataacctaca cacatcttcc catatacttt 
24781 aaatcatcac tagattactt ataataccta atgcaatgta aatgctatgt aagtagttgc 
24841 tacgctgtat tgcttaggga ataatgacaa gaaaaaaagg ctgtacaact tcagtacaaa 
24901 tgctatcatt cattttttta tctgaatatt atgatccatg aatacagagg gctggctgta 
24961 aaaagaatgt acagtttggt actttactcc tagtttaagt catctctttc ggcccataca 
25021 ggttgcctgg ggcagggcgg tggtgagtgt gttgagtcgg ccctataccc tgttccacaa 
25081 tgcctggtgt ccttggcacg tagcttgcag ggctgcttcc atggatggct gcctcatccc 
25141 ccacccctcc ttcatggcac cctcttttct ccaactcctt ttcctctgct gagtcctacc 
25201 ttttcttgcc agaccaggga tttgccaatt attctggatg ggcacaggga aggggccctc 
25261 agatcctgca gagggagcat ctgaaagaca atgagaccct gtacacaacc aagctgagcc 
25321 tcctgtaagc acatcctctg ggtgtcacta tgcgaaaaga ctctttgtct gtttacattg 
25381 cacaacgaat tgtccctctg attgtcctag cgttggtgca tgcatggatc caggaagagc 
25441 aggtatttgg acccaacgag acaggttcat cttggcagtt tgtctcctgt ggttgctttg 
25501 tgcctgcccc gtatctatcc acccaacttc tggcagcaat ccctccagga cccacctctg 
25561 tcttcatggt ttgtgtgttc tgatcctgct ccttggcccc agaatacacc ctccacatct 
25621 aagctatgca gtgcctcatc cccacagctg cagcgattgg tttaggaatg ggaaaataat 
25681 ccaatcaaag taagattgtc taatatctga tcccgcttac cccggagcca gaagtcccac 
25741 ctttccaggc tcctccatgt tcctttctgg agcaatgagg aggaaggttc attcccttct 
25801 cagggaactt gctgcggtgt cccagtcctg actgtcccct ctctacccct gcctgtggct 
25861 ctcccgtctc ctggcattct aacctgccac tgaattcttc ccacgccaaa cgcaggttag 
25921 gaacctaagc tcggatgaaa tgctggctta gacctcattg ctactgagaa gcagaaacca 
25981 cactttttag ttggtagagg tggatgtgga ttggggccat aaagtttcct caatcattga 
26041 ataaggattt cattatttgg actaactaaa tacattagat tatttatact aactaaataa 
26101 attagattta tttttcgaag tttaaaatca ttttcaatag tgacgaaaaa ggctgtagat 
26161 gagatgggag aatctttcga cgccaaattc ttcccatctc ctgcctagta tgtatcacta 
26221 caccttgccc agagagactg taggggtgct cagcaaaaac catggctggt tgattccctg 
26281 gaaaaagggg tgctgtccca gggagctctg tgcccttcct gccaacactg gggtctagag 
26341 gaggcaagtt cccctctgaa ggtgagtatg attcactcat taaaactaag aaatttacat 
26401 tggtatgtta ctgttaacta acctacagac tttttttaga tttcaccgtt ctttacactc 
26461 tcgtcctttc caatttccgg gatccaatgc aggataccag gttgcatttt tggatgtgtc 
26521 cagtctgtga tcatttctca atctttcctc attttctcat gacttttgac atttatttat 
26581 ttatttattg tgtgagactg agtctcactc tgttgcctag gctggagtgc agtggcccca 
26641 tcctggctca ctacaatctc cacctcccgg gttcaaatga ttctcctgcc tcagcctccc 
26701 aagcagctgg gattacaggc gcacaccacc acgcctggct aatttgtatt tttagtagag 
26761 atggggtttc accatgttag ccaggctggt ctcaaactcc tgacttcagg tgatctgcct 
26821 gcctcggcct cccaaagtgc taggattaca ggcgcaggcc accacgccca gcctgacttc 
26881 tgacattaga gtacagggag ggattctgta gaatgtccct caatttgggt gtgtctggtg 
26941 ttttgccatg agactggggt tatgcttctg ggtgccacac agaggtgaaa tgtccttctc 
27001 atcacattat tcgagaggta catggtacca atgggactta tcgctgctga tgttaacctt 
27061 gatctcttgg ttcaggtggt gcctgccagc tgtctccact gtggagttac tatttttcct 
27121 tttccccatt ttattcatca gaagccagtc actaagcgag gtcaaactcc aggacagggg 
27181 aattaagtgc caccttctgg agagggagca ttcacattta ttacttggga tccttctgta 
27241 aggaagagct gtttctcctc taaaaaactc tttaatcctt ttaagcctca atttcttaat 
27301 tgtgaaatgg ggctaatacc tgtatccaac caagggagta gttagaaggt aacatgatag 
27361 gtggaaagca cttaacatag gcaaaatgtt atgatcagga atgatcgaga gacccatcca 
27421 actatctgaa ggagtcactt aactctactg tactgcagcg ctgtaaagtc tgcatctttc 
27481 actgggggta aaggccccca gtccctgaga cgggccagtt tggagacagg ctggtttttt 
27541 ctctgttctc ctgagagccc ttcagatgag aagggaggtc tggagacaga atgccaaaag 
27601 cccattaaag gcacggcctt gcatttcaga gagggagcag gtctagagaa gaaccagagg 
27661 agctcagctg agatatggtg tatggattgg attttggtag aagatgggaa gaaccaaaca 
27721 cctgagaaac cactttgaag atcggggtca gagtaaggcc taacacatag ttggctccca 
27781 gtaattattg gttgattgaa cagctcaaag agcaactcga ccaagaacac tggactggga 
27841 gtccagttac ttggatcttg cattcctgat ttatttttat tttatatgta ttttttctat 
27901 ttttttgaga cgaagtctca ctcactctgt cgcccaggct ggactacaat ggcacgatct 
27961 cggctcactg caaactctgc ctcccaggtt caagcgattc tcctgcctca gcctctcgag 
28021 tagctaggat tacaggcatg caccaccacg ctggctaatt tttgtatttt taggagagac 
28081 ggggttttgc catgttggcc atgctggtgt ccacctcctg acctcagttg atcttcctgc 
28141 ctcagccttc caaaatgttg ggattacagg cgtgagccac cgtgcctggc cgtgatttat 
28201 tttttttgtg tatgtttgtt tttgtcaact tgctgtgtga ccttaagcaa gttacttaac 
28261 ttctctgggc ttcactttcc atggatgaac attgtaaaga ggctggagag agatgaggac 
28321 taggtacagg ctttagagga gagccaccgc cccggacttc tccctctgtc accccgcttt 
28381 ccatgaccct ccttgcctga ctttgtgact ccttgcctcg ctatcaaaac aagtgctgca 
28441 atctcagtgc tttccaagag ccctgcattg ttagaaactt cccagcacgc agcaaaggct 
28501 gctgcaatac tcgctctgcc tgcctttgcc ctgcgcttcc tacttaccct ccttttgttt 
28561 ctcccaaaca tctgtccctg actatgctca tctcatgttt gtcctcagct gctgaaaggg 
28621 ccacgtttgc tttcattaca aataagacca ccgagtgggc tcctggcgtg ggggcgggag 
28681 cagccgcgcg cagtcttcag aggcagcccc ccaggctgtc tctggagggt gtgtctctgc 
28741 ttccctttcc ccgtgtttat tttcagacga agccaagtgg cccgggggga ccctccggac 
28801 tcccagcctt cagagaggag ggcagctcgg gctttcgccg cagtgcttcc tgcccgtcac 
28861 gtgtgtgctc ctagccgggg tcgggggagc tggtatcttg gcccttctgg gaggacgcgc 
28921 acagcccgag gaggcagagc cccagacggg aatgggcttt tcagaggtgg ggtgcgggcg 
28981 aggggacgat gcattatttt taatatttga tttatttttc caactggact tcttcccggg 
29041 gctctttctg ggcccagctg cctttgtgat cccgcgcccc ggtcctcggc ctctcacctc 
29101 cagcgccggg gcgccccctg ctgtcggaag cggctgtgac cgggcagagg tgctatctgg 
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29161 gactctgggt tctcagcccg gggacagcga accgaggggc agatgatcca tcagaaaaga 
29221 gccggcactg cccagccccg cgcccctgcc cctgcctttt tccgggagcg cgccgcgccg 
29281 cacccgctac ggccgcttga ccccatcttt gagcccggcc ccaagctctg ggaccgtcgt 
29341 gcccctcatc aaggaagagc caaggacccc aaggagaagg tcaggagcgg cggtgtggat 
29401 gtcccttggc tgcaggcccc gccgcgcact cccttcagtc cttcccttct ctagggacca 
29461 ggtagcatca gtgcctggat ctcggccttg tgtgccctgc tccctgcccc acctactaag 
29521 aaccaagtct ggttcaccgg ctcccaagag ctggaaccca ttctcagcta gctgggggcc 
29581 caggccaccc cttccctcca gacctgtgtg ccttctgccc tggctccagg gccccccaca 
29641 ccgtgaccag ggcgggatcc ctatggggct ggccagtcgg caccgtgcca ggcccacagt 
29701 gccctgggcg tccatggaag tcgttctgtg tctttaaaat cagaaggaag acattaacct 
29761 ttaggctgaa gaaaatgttt tagtacacag caataactta tttgtcttta tccaacagcc 
29821 ataaaatata actttaaata ttctattgat agagaaagga gttcatgaag gcagaaatgc 
29881 ctggggccca cgaacatccc agtgtggccc tggacgggac atcatgctgg gcaacacagc 
29941 taaaatgcgg gtgaagacca gatttcttgc acatggcggt gacgggatgc tccctagaga 
30001 gcttcaagtg gattctttgc tttttatttt ctctcttaat aaaaatgtat gatgtttaca 
30061 ttgtcagaga acaaacagaa ctgtggcttg tgtttcatgt cttacgaaat gggtcttgtg 
30121 tacttttggc tgttctttca tgttcttagc agttatagat ttggcccctc tggaatggag 
30181 gggccatata tcagggggtt tggggcaggg gacacggccc tctgttcacc ctgccctccc 
30241 tgataggttg atttatctcc ctgcaatccc ttattcgcct tccgggctgg tctccttccc 
30301 agtccgagat ctgcccccaa ggctttgttt ggcaccgccc cacccctgcc ttccccagta 
30361 gtctttctac agatcacccc ttcccatgtg tcctcagcgc acccctgcct tccccaatag 
30421 tctttctaca gatcacccct tcccatgtgt cctcagcgca cccctgcctt ccccagtagt 
30481 ctttctacag atcacccctt cccatgtgtc ctcagcgcac ccctgccttc cccaatagtc 
30541 tttctacaga tcaccccttc ccatgtgtcc tcaacgctgg ccttctcagg agttctagca 
30601 cccgtctcca ctgctttagc ctgctaagac tgccataaca aaccaccaca gcttggggga 
30661 ttcaaccaca gaaatgtgta tgctcataat tctggagacc agaagtctga gatcaaggtg 
30721 tccacagtgc tgatttcttc tgaggcccgt ctccttggtt tgtcaagagc ctcttctccc 
30781 tgtgtcttca cagggtcttc cctctacgcc tgtcagtgtc ctgatctcct ctcagtcaca 
30841 ttgggttagg gatcaccata tgaccttgtt taaccttact cgtgtcttta aagactctat 
30901 cttttttttt tttttttttt tttttttttt ttgagacagg gtctcactat gttgcctggg 
30961 ctggagtgca gtggcgccat ctcagctcac tgcaagcctc tgcctctggg gctcaagcaa 
31021 ttgtcctgcc tcagcctccc gagtagctgg gattacaggc gtgcgccacc acccctggct 
31081 aattttttgt atttttagga gagacggcgt ttcaccatgt tggccacgct ggtcttgaac 
31141 tcctgacctc aagtgatcca cccgcctcgg cctcccaaag ttcggagatt acaggtgtga 
31201 gccaccacgc ccagccgtct ccaaatacaa tcacattgtg cattactaca gttaggattt 
31261 caacatgtga atttgagggg gactcaaatc agtccataca cccaccacca cctccacttc 
31321 cattcacaga aatgattgcc agaactacag accttgagct tgtgtggggt gacaatccga 
31381 caagcaacag tgagacagag ctgagagggc tgcagaccag gatggtggga ccgaggttgc 
31441 taacaaagcg ctgctgagac ctgcacagca tggaggtgct gctgggggag aggggtcccc 
31501 gtgagtggat ctgcttggtc tgcccttgca aatagttctt gtcaacggga aggacttaga 
31561 gccccttgat ctacactcat ggatactttt ttgaactcaa ggtttacagt agaaaccagc 
31621 agcaccccat cccctagggc tctctaaccc actgtgtctc taccctgacc ttacctacct 
31681 tttttttttt ttttttttgt gagacagagt ctcgctccat cgcccaggct ggagtgcagt 
31741 ggtgcgttnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
31801 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnntt ttccagaatg 
31861 tcatctagtt ggaatcatac attccgtagc cttttcagat gggcttattt cacttgacaa 
31921 taagtttcct ccacgtcttg tcatggcttg gtagctcctc tttagtgctg aacaatattt 
31981 cgttgcctgg gtgtaccaca gtttatatat ccagtcacct acatcttggt tgcacaagga 
32041 catctcggtt gcttccaagt ttcagcaatc atgaataaag ttgctataaa cattcatggg 
32101 cgggtttttg catggatata agtttttaac ccatttgagt aaataccaag gagtgtgatt 
32161 gctggatggg atggtaggag tatgcttagt tttgtgagat acttcccaac tgtcttccac 
32221 agtggctgca ttctcatcag caatgaatga gagctcctgt tgcctcacat cctcaccagc 
32281 actggatgtt gttggtgttc tggattttgg ctgttctcat aggtgtgcag tggcattggt 
32341 gttttaattt gcatttccct gtaacatatg atgtggagca tcttttcatg tttattttgc 
32401 atctgtataa cttctttggt gaggcgtcta taaaggtctc tggtccactg taaaatcacg 
32461 ttgtctgttt tctcattgtt gagtttttgt ttttgttttg agatggggtc ttgctctgtc 
32521 acccaggctg gagtgcagtg gcataatcac agctcgctgc agacttgacc tcctgagctc 
32581 aagctatcct cccacctcag cctctcaagt aactgggact acaggcatgc gccaccatgc 
32641 cttgctaatt ttttattttt tgtagagatg aggtctcctt ctgttgcccg ggctggattc 
32701 ttgagttgta agttctctgt atattttgga actacggtgt atattactat actggtggac 
32761 acatattaga aatgggtgca aacccataga ctacgcaaca ccaagagtga accctaatgt 
32821 aaactgagtg acagtggact ctgagtgatt ataacgtgtc agtgtaggtt caccatttgc 
32881 aacaaatgta ccgctctggt gggagtgtta ataatgaggg ctgcgaagca tgcatgggct 
32941 cagggaggat gtggggaatc tccgtatttt ctgctcaatt ttactgcgaa tttaaaactg 
33001 ctctaaaaaa taaactgtat taaaacagaa aaaaataatt taggattaat caacgtgtaa 
33061 gttcctggtc attgaagtca gaacctcacc tcatttggaa gcaggaaatc ccttctgtac 
33121 tcaccgtgtg catgggacac actgaccaaa agcccctgcc cattggcctt cctgggcctg 
33181 aagtggcact tcatgcttgc cctccacttg cctgtcgcca catgcatgcc agggtggcct 
33241 cagccctgcc ttaccctttg caggcccaag tcagcggggt catccccaga aaggacttgg 
33301 tgtcatttct ggaacaagtc atttctgggg gtgcccctgc tgtcacttcc ctctgctcca 
33361 gggaaccctc tcctaggccc tgccagggtg tgggacttct tgctgacccc agtccttggc 
33421 tcatctccct ctttcttctc ccagggtcgt tgaagaggga gcctattcct tccaagcctt 
33481 aaagccaccc tccttggctt taagttacag ccaacattct ggcagtcgta ggatccacca 
33541 gcagaagcta tgttctcctg cttggtctat gggtgaatct ccgcagggcc ctctgttccc 
33601 cactcggtcc ccactgtgag gccccgcctg gcccctctag gttctctgcc cccaactggt 
33661 cctgcccact tctgctctct ccttcccttc tcctcaatca tcacgccctt gttctcaggc 
33721 ctcactgccc cagcacttag cacagaggat caggaacttc cacagtgatc ctaaatctcc 
33781 ctgttctttt taaatttact ttatttttta gagcagtgtt gggttcacag tagtttttgc 
33841 ttccgtcaaa aaatgcttgt tgactcggaa cctcttcctt ggaaatttca actgcaaatc 
33901 tgagagctgc tttagagaac cctctgtggg agagtgcctg ccacctgttg ctccagtttt 
33961 ctgtacataa tctatgatcc tcctggcaat cccatgatgt gggcattgtt attcctgcgt 
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34 021 tataatcgag gatgttcagg acagagccta tctaggtggg aagtggcatg gccaggatat 
34 081 gccccttggg tgccctagga ccagctctgt tcccatctct gagctccctc tgcccatcgg 
34141 gtaaggtctg gtcacttcca tactgttttc aatctactct gtgccaggca gggggtcaat 
34201 cactgagatt tgtaaggtac atgggtgggg ctggaggtta gaagagagtt ttcaaatcac 
34261 tttcagccta gtggggaagg caggcccatc accaaggcag tccaccaaca aggctagagg 
34321 gagctgtgct tagcctgagg gaagagcccc atttctgtgg cattatgggt gacatttgaa 
34381 caactaacca ctcatttcat tacccagggt caaacttcgc cccagggata ttcattagta 
34441 agatgaagag gcttactgtt tgctgtggtc ctaatgtctg tgtcccccaa aattcgtatg 
34501 ttgaaatctt aaccctcaag tcgatgatat taggaggtag ggttttgggg agaagattgg 
34561 gccataagga gtgaagttcc atgaatggga ttaatgccct tatgaaacag gccccagaga 
34621 gaaccatcgt ttcttttgcc atgtgaggtt acagtgagaa gtcgctggct aggcgttgat 
34681 ggctcttgcc tgtaatccca gcactttggg aggccaaggt gggaggatca cttgaggtca 
34741 ggagttcgag accatcctgg gcaacatagg gaaaccacgt ctctacaaaa aacacaaaaa 
34801 ttaggccagg cacaatggct cacgcctgtg atcccagcat tttgggaggc tgaggcaggc 
34861 agatcacttg aggtcaggag ttcaagacca gcctggccaa cacagtgaaa tcccaactcc 
34921 actaaaaata caaaaactag ctggatgtgg tggtgcgcac ttgtagtccc agctacctgg 
34981 gaggctgaga caggagaatt gcttgaacct gggaagcgga ggttgctggg agccgagatc 
35041 gtgccactgc actccagcct ggacaacagc atgagactcc atctttaaaa aaaaattagc 
35101 ctggtgtggt ggtgtgtgcc tgtaggtcca gctactcaag aggctgggag gattgcttga 
35161 gcctaggagt ttgaggttgc agtgaactat gaccgcacca ctgcactcca gcctgggcaa 
35221 caaagaaaga ccctgtcttt aaaataaaat aataataata ataataataa aagaaaggcc 
35281 atttatgacc cagaaatcag gcccttacta gataccagat ctaccagcac cttgatcttg 
35341 gacttcccag cctccagaac tgtgagctgt acatttctat tttttaaaag ccaaccagtt 
35401 tatggtattt ttgttatagc agcctgaatg gacacagtta aacctgactc cagaaagtaa 
35461 agcaggcccc tgtgaaggaa ggaagaggga ggaaggagga aggaagaagg gggagggagg 
35521 aaggaagagg gaaggaggga gaagggcggt ggcctcacac ccccacatct catccgcccc 
35S81 tggtgggcag cctccccaag ggattgattc cttcctcagc aacagttgct cagagttgag 
35641 cacaggaact tgcctttttc caagtagcag acacaggccg aggcacagag tggggctgag 
35701 gaaatgtgcc gtgaatggat ggtgaacagg aagggacact gaagaaggag gcggtactga 
35761 gagcagccat cttggagctt ggaaacggag acgcatgtga agccgtttgc acagtatttt 
35821 tcccttaaaa tatgggtggg gaacaagggg ctgtcatttg tctttttctt tgggatggtg 
35881 agcagtgagc agggcacgcc gcgaggagca aactgcagcc caagcacacg cggtcagcac 
35941 gcggagcaca ggtggggaga agcggctgac ggtggcgtgg ccccgcgtac ctgggtgtgg 
36001 acgccccgcc gccccgcagg ggaaagctcc tggagtctgg ggtctgctgc cccgagctct 
36061 gggtgtccag gtaggaaaca ggtagagagt atccgggttt cctacccctc cgccggggtg 
36121 gctgtgcctc cccatcctcc tgctcccctc cttccctccc tccgcagggg agggcaaacc 
36181 caacccccag gtgagggtct gttctgcggg agtgggaaac agctggcaca ggctcaggcg 
36241 gcaggtggtg gccctgctcc tgctgtgaga gatggaagcc aggaggcggc gggcgggccg 
36301 tgctcaggag acccatgccc cacccagcct gagactctgt cattcccttc cactgggctc 
36361 cccatgagaa gccccaaagc aaagtgggtg tggctggtca gggtgagtag ggcaggccgg 
36421 aacggacccg gacccagaac ggaggagggt ggtgtggacc aggtgggccg cggaaggcac 
36481 ctagcctaac aggagaggga caggcgcacc cccagacggt gggaaagcgc cctctgcctc 
36541 cagctgggct cggcccagac cctccacttc ccagcccctt gcttctctgt gccttggagg 
36601 tttcagggtg ggctggtggg tgggaggaca tgtgagagtt cctgtgtcag gagttgaagc 
36661 ccatcttggc cacttgcctg ccggggccct gtagaaaccc ctgaagctat aaacaggtgc 
36721 tcgtcaaagt tgattccagg ggagggtggc aggagcagtg gcgggtggtg tgggaagaga 
36781 aggactgctc tctagagagc tgatggcacc aggccgtcac caccgtggtg acagcacaca 
36841 cacatccaca cacacatgca cacacgcacg cacacacgtg cacacacgac acacatgcac 
36901 acacatgcaa gcacacatgc acttgcacgc acacacgtac acgcgcacac gcatgcacgc 
36961 acgcacacgc acatgcatga cacacatgca cacacacatg ccccactcct ctactgggac 
37021 cagcacagcc cctccgttcc tgctcgcccg gaaggagggc gcccagccag gtttcctccc 
37081 catgaattga gcgccagaga ccaacagtga cctgtggtgg tttttccaat tctcttccca 
37141 ttctgtcaag gacagaggag atgctttgtg atacaagggg aaaaaaagga aggtggaaag 
37201 ggaagaaagg gaagggaggg aagtcccccc cagagcagcc tggggagggg tctgggacca 
37261 ggcctaggcc caggcacttc ctgaagagat caggtgacgg cctccccggc ctgcagacct 
37321 caaggcggcc caggcctcct ctgcgctccc tacgcgttgc tctggtgctg acatccctgg 
37381 ctatgctctc catgtagatg ttccttttag cctctgcctc aactgaaatc tgaagatcag 
37441 gccaatttcg- tggtctctca gacgtcggta aacaagaggc cttggctcct caggagaccg 
37501 agagtctctc actgtacttc cttctcttgg ctccaaggac atagaacggt tgctatgggg 
37561 attccttcat ttgtaaagta agttttcaga aaaaataatg actcacaagt cctttaagca 
37621 ggaacaaagt caatccttgg ggacgatatg atctaatagg gtttaaaaga ggaactgtga 
37681 cgggctgcag ttggcgggct tttcttcttt tcctctttga tcctcacgcg tggacgaaac 
37741 ccaccagtgg ctcaggtgcc ctggcaccac catttgcttc tttgtgacat ctgaggacag 
37801 gtgttaaaaa taacttagaa atggctaact aaggcctgac acactggctc acatctgcaa 
37861 tcccagcact ttgagaggct gaggtgggag aattacttga acccaggagt ttgagaccag 
37921 cctgagtaac gtagtgaaac cttgtttcca caaataatta aaacaaaaaa actagccagg 
37981 tgtggtggtg catgcctgta gtcccagcta ctcgggaggt tgaggctgga ggatcacttg 
38041 agcccaggcg gttgaggctg cagtaagcca tgattgtgcc actgcactct agcctgggag 
38101 acagaatgag atcttctctt agaaaaaaaa agaaagaaaa aaagaaaaga aaagaaaaga 
38161 aaaggtatgt ctagctaggc taacctggag agttactgtc agagtttcca aaagggtctt 
38221 catggccact tctgggcctc gctggctggg tgaagttatt agtccaagag tgccacatgg 
38281 cctttgcaca ttcatctgta ttatgcctgg cgaggcagag ggctttccct gcattgattc 
38341 tcctggaagc ctggggttag cagtatagac tggaaagtaa ggtacacaga gtggagggac 
38401 tgaaccaggt ccccttatat gtgagtacat acagttactt aggccgggaa gccaggtctt 
38461 ctaagcagct agggccaaca gagcaggagc caggaggaat acttttatga acaaacttat 
38521 tttccaacat cagaaaaaag aaagttggct gttgggagtg acaaggagaa agaatgatga 
38581 ggagctcaaa tcaggtggac acctgattga gatgatcctt cttttgttta gaaaagagta 
38641 ccacactggc tgtcagctgt gtgaccttga gaaagtcact ctctctctca actttctcat 
38701 ctgccaggta aaagggctga gcttgaatga gcttgggtcc cttccagcaa tgatgcttgt 
38761 gtactgcacc tgtgctagga ataccagtaa ggggctctcc atgctaaaat cagcgttcca 
38821 ggattccacg tggcccacag caccaagccc agccttctca gctttgagct gaaggctccg 
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36881 ggtaacctga tcccagccca acggttctgt gttatctccc tctcaccatt tcctgtggat 
38941 tacttattgt tctatcctcc cagccgccag ggccttgttc gcacctgcta ccagcaaaga 
39001 atacccctcc taccccagtg ccctggcctg aagacgtcct gttagtcttt cagctgtcaa 
39061 tgaagatggc ccccttttca agaacccttc ctggaatcct gagtttgggg gtcgtgatct 
39121 ttccaatccc gtatccacgg tcactgcctt cacctgaatg cgattgcaat tcgctagccc 
39181 ttcaattcaa caaatattct ggaatgatta gcgtgtgctg tgacaatgac aaattccgat 
39241 acgggaagac atgaattaaa tgatcacgtt cacccacgcg tgattagaaa cagggaaggg 
39301 taaagaagga agagaaaacg gcttcctgag agcttgcctg gggtctggaa ggatgaacac 
39361 ggagctgggg gcatggaagg gagctgttcc caggagagtt gagtgggctg ggctggcctc 
39421 ttgaggctgg gagaggtaga cggtgtgtct tgatcagagg agccatctgg gaatcctttg 
39481 agggtttaaa aaggggagta acagatccaa tttttatttc aggacaatca tgctggctgt 
39541 tagtgaagaa tggcctggag cactgggtct ccgccccagc tgcctgggag aaactgtgtt 
39601 gaaggagcca taccccctgg cctcccctcc agactccaga caaccccagt cagctggctg 
39661 ggagggaggc agaagagaaa caggggtgca cctggggctc ctgcaggggt ccagccagga 
39721 gatgggggca acttggacag ggacggtgag agaggaatgg gctcgggggg catttgggag 
39781 atgaaaaggg caaggcttgg ccatgaattc ccatgggatg aaagagaagg aattgtcagg 
39841 attgagtccc aagtttctag ctcatgcaaa taaacaaacg gggcatccgc tgagctcaag 
39901 agcatgggga aggggtgttg gagactctcc cggggacccc tgcagacttc tgtaacccct 
39961 ttgagttctc gcaggcagag gccacggtgg gtggggagca ggccttggca gggtacccgc 
40021 cccccaccag gcaccctcca tcctagtagc ccttggggag acatactgtg gccagaggaa 
40081 gggtacggag aggtagagga agccaaccgg tgttcccggg cagccaccat gtggtcaaag 
40141 gaccaggcag cttgtttgga agcgtctctg aggcacaaag tgccaggacc cagtgctccc 
40201 tgagccctcc cagcaggaaa atgaaaactc agcaatgggc tcccagggag gcgggactgg 
40261 gggtgccttc catttatgag aggcccgcag agcaattcct gccgcagggg tttctgatca 
40321 gtcagaacat ctgcccacgt tgatgtcttc cagccacctg atgcagtggc ttgggcctgt 
40381 gggttggggg gagtgggatc ttaggcctgt tttttgggtt aataaatttt gtgaccttgg 
40441 atgacatgtc acttatctca cagatcctgg gaagggcagg aggaggagtg agacctggcc 
40501 tgccctctcc ctagaggctc atcagctccc tggagacaga cacgccctgg gtaaagaaca 
40561 gatcccactc agccaggcag agtttcctgg aggaggagga gggagagaaa gagggcagga 
40621 agaagagggg agggtgcaga gggggagaaa gaaaggaggg gaggaggtga agagaaaagg 
40681 gaggaggaga aaggaaaagg agaaaacctg ttttctgtct ttaactgtcc ttcccctcaa 
40741 ctgggaccct gtgctcctcc tcctcagtca gaactccccc ctcaggtccc cccacttcac 
40801 gtccctccac acttcctgtc cctcccccac tcatgtccct ctccccttat gtccctcccc 
40861 ctattgtgtc cctcctcccc ttgtgtccct cctccccttg tgtccctccc cctctcaggt 
40921 ccctccccct cctcatgtcc ctccccatct caggtccctc ccctcctcat gtccctcccc 
40981 atcacaggtc cctcccctcc tcatgtccct ccccatctca ggtccctccc cgcctcgtgt 
41041 cttcctcccc ctcaggtccc tcccaacctc ctatccctcc tcagcttgtg ttcccccacc 
41101 gctgtgtccc tcaccccctc atgtccctcc cccctcaggt ccctccccct ctcaggtccc 
41161 tcctatcctc gtgtccctcc ctccctcagg tccctccccc actcaggttc ctccctccct 
41221 cctgtccctc ccccgctcag gtccctcctc ccctcacatt cttctgtctt catttacagg 
41281 caggagaaat ttcaaagtgt gtttgtacca caattacttt aaaactctaa agttgacctt 
41341 taacggtttg tggatgtttt aattttactt ttacgtgata acctgcaata actacacaga 
41401 tgcatgcatt tttgtgctta tttgtaagtg ctttctttaa aactgctttt ttcttacatc 
41461 ctttcagttc cttt;ccctat gcctggagcc tgggtaccag ctattgaaaa ctgcttttct 
41521 ccttgttcat tcaaacatat tcacacataa aggttgtcat tcctttttgt cactatttga 
41581 caaaaattgg atcataccgt atacaattct ctgctatttg ttttgctcac ctaacaataa 
41641 atacactgtg aaaaatacct tcagatgagt agatatcaat cttgttcatt cttttttttt 
41701 tttttttttg agacagggtc tcactccctt gcccaggttg gagtgcagtg gttcagtcac 
41761 agctcactgc agcctcgcct cccaggctca atcgatcctt tcacctcagc ctctcgagta 
41821 gatggaacta caggcatgca ccactgaacc tggcgaattt tcatattttt ttgtagagat 
41881 ggggttttga catgttgcct aggctggtct taaattccta agctcaagag atccgcctac 
41941 ttcagcctcc gaaagtgctg ggattacagg cctgggccac cttgccctgc ctcttactca 
42001 tttttttatt ccaggttgca gggttttatt tcagcaacac agagacagtc aagcacagcc 
42061 atacagcagg ggcatccaca gctgcctcag gaggcggcag cgtggcctct tttgcagctt 
42121 ttctctcttc cagcatcctc tgcttgtgtt tggcccggca cttcctggca gaatggtggg 
42181 cccccaggtc ctgatagcgg tcctggtaag ccttggccac ctgggtgaat gctccacttc 
42241 cttggcccag ttctacctgt agttgtcttc ttcccactac tgacaggtct tgagccgctc 
42301 ctggataata ttgacgattt cttgatcgac tttgtagtcc ctcctccact gcatttcggc 
42361 ttcatatatg cacagtgttt cacgtgtcca tgtgaagaca cgtgagacca ccaagcaggc 
42421 tttgtgtgag caacaagcct gtttatttca cctgggcgca gggggctgag tcctaaaaga 
42481 aagccagcaa agggtggtgg attatcatta gttcttacag gttttgggat aggcggtgga 
42541 gtttggagcg atgttttgct ggcaggggct ggatctcaca aagtacattc tcaagggtgg 
42601 ggagagaatt gcaaagaacc ttcttcaggg tgggggagat tacaaagtac attgatcagt 
42661 tagggtgggg cagaaacaaa tcacaatggt ggagtgtcat cagttaaggc tattttcact 
42721 tcttttgtgg atctttagtt gcttcaggcc atctggatgt gtacgtgcca gtcacagggg 
42781 atatgatggc tcagcttggg ctcagaggcc tgacacacaa gatgttcttc tccttgcgct 
42841 cagtgatgcc cagcacgcgg cagtactgcc ggtggtagta gtaatacctg ttctttgcgt 
42901 gctgccactc tataaactct cacgagggtc gccgatcggt ccactaggag gtcgaaggcc 
42961 ttcatcaggt agacgacggg gttgggcagc caagtctgcg atgacagtgc tggcatgtgg 
43021 cgcgggggcg cagggtacac atccttgtct cacctgtccc acatggcagc agtgagcgcg 
43081 gacttcgctc ctggatgcgc tgccctggcc tccacctcgg tcccggtccc tactgattct 
43141 ttgtaatggt tgtgtaatat tacatggtgt ggacttaata aaggacccct acgcagccaa 
43201 gagagttttt ttccccaatg ggaaaacaga tttttttttt ttaatgtttg agattttagt 
43261 gaattatttt tacctggcaa ggagctgaac tacctgtact gttcagaagc caagttcagc 
43321 agtcttgtat gtgggatgtc tgtaaagatt aagaaagggg gcgggcgtgg tggttcatgc 
43381 ctgtaatccc agcactttgg gaggccgagg tgggtggatc acgaggtcag gagttcgaga 
43441 ccagcctggc caacaaaggg aaacccccgt ctctactaaa aatacaaaaa ctatccgtgc 
43501 gaggtggtgg gtgcctgtaa tcccagctac ttgggaggct gaggcaggag aattgcttga 
43561 acccgggagg tggaggctac agtaagcgaa gatcgtgcca ctgcactcca gcctggcgac 
43621 aaagcaagag tctctctcaa aaaaaaaaaa aaaaaaaaaa aaaaagaata agaaatggac 
43681 tctgccccaa ggagtctgtg ctcttaccag ggggcagagc cacacgtggc catgagacag 
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43741 ctgcccctgg cagctgcacg gaggagagct tgaccgagca tgcgggaggg agggcttgag 
43801 gctggcccgc tgggagggga cagctctggc aggaatcctg gaagggcagg tgtggagtca 
43861 gacatgatct gatgttctgt gtggccaagg cagaaggcca ggcaggaaaa acccggttac 
43921 taagaagatt ggttgagcag ccatctccct cccactgacc gccccccagg aatctgtata 
43 981 acttcagctt attttcagcc tttagtttcc cctcgttgga ggttgagtgc ctggtctctt 
44041 gattttctaa ctgaagaagg gatgaaagtc tgagatttca atcaaaagct taagaaaata 
44101 ataatccttg gttggcccaa ttctttcctg aaaatacctc acactaagag aaattgcctt 
44161 taagcccctt ttcttttctt tcatgcttta aatttctttc ttaattaaaa agagttgttt 
44221 acttgatgtg ggctccaaag tgcttattta ttttaacgta tcttgtttct tttttcctta 
44281 tcttcatgac ttgataactc tataactctc tcttcttgtg tcttaaaatg ccggtcgtgt 
44341 ctgtcactga catgtgataa ggaacttcat aatcagccct tttctattct cacagaatag 
44401 aagttgtgtc aataacaaat ggaaatcact gagctccaaa attcagctgc cctaacctgg 
44461 ctgcttgtcc ctatcagaga ttttagaatg ttttcccctt cagatgaatt tcagtttcac 
44521 tcaatcctac cagtgtttga tgacaacact catggaaaac taatggtaaa ttaatgtctg 
44581 ccttccaaag tacgtggttc agttcactgt gtaaaaaaat aatgataata atactaaagg 
44641 cagccaaact gctgattggc agaatttttt cccatcactt aaggcatttg agttgcagta 
44701 cgcagtctga gaatccctca gagacacacc acccagggac ttggatcctt gatctcagaa 
44761 agattgtgat cgaattaagt ctgactcaac aaataaggtc gctgggaaga agtgctggca 
44821 ttaaatttac cgttaacgcc agggtcaatt tactaaacag tcatcgacgt gtggttccac 
44881 aagacctctg ggtaaattcc agaaactgac tgggaggaag atagcttgcg catgaagtca 
44941 ctgaatcctg gtcagtgtga ctcttgatga tcatctctca gtacaccaga aagcatttcc 
45001 aaagacagga cgccacttgg cgtggaaagc ctttggaagc tatggaaggt gacttcttaa 
45061 gttaagtcta gaaacgggat ggcaggagga caagaaactg ggactctggt gccgggagcc 
45121 tggagtcctc attccactac agatgttaac tgggctcagc tattcaggtc actcagtggt 
45181 tcatcttaga gttgtttcta gactagatat ttattgccct gcattacaaa gacgggaaag 
45241 taagagttct gtccatcagc cagttgtcag gcgtaaatga cgtctttccc tcgcttcctg 
45301 cgccgcagtt tgcttatctg tcagatggag acgctactta cttacttatc tactcatcac 
45361 ctccaggact attcacatga ataagaagag aatatagata tgaatttggc atacagaggg 
45421 aaaagttcca cacaaataga aaacactagc ggaaataatt tctagattct tcctggtgaa 
45481 gcttttcaac tacttccaaa acacatcatt gctgagcctg tgcaaattca aatggcacaa 
45541 cccgaagagt cagtggcatg gcctcttctc tggccagagc cttgggggcc atcagcctgc 
45601 acctgcctgt ttgttctgcc attgcctcat gttctcttcc tcagacaccg gggaagccag 
45661 gatcacagag gcacttgtca gcttccttct gaaacagccc tcctcagctt ctctgctgtg 
45721 tctcccgggt cttctcacag aattcttggc cgggaagcca ccttccttaa aggaaatcaa 
45781 gtctcccgat ttcctagagc cagagtagca tagggggcag ctgtggggct tggagccaga 
45841 cagagcctta ttaattatcc atggatcttg ctgagccttg gtttctcatc tgtaaaatga 
45901 gggtaataaa gctactttcc tggtcatcta tgagtttgaa ataaaagaat atgggggcca 
45961 ggtgcagtgg ctcttgcctt taatcccagc actttgggag gctgaggcag gtggattacc 
46021 tgaggtcagg agttcaagac caccctggcc aacatagtga aaccctgtct ctactaaaaa 
46081 tacaaaaatt agccaagtat ggtggagggt gcctgtaatc ccagctacac gggagactga 
46141 gtcaggagaa tcacttgaac acaggaggtg gaggttgcag tgagctgaga tggcgccact 
46201 gcacttcaac ctcattgaca gagcaagact ccatctcaaa atgaagacaa aaacaaagaa 
46261 gagtatgggt gaaagggcct ggtgtgtcac gggagttgaa tgtgtgtcct tttccttttt 
46321 cccctgttcc ttccctgtgc agctgcctac caggcactgt ctcctctctc ctggggaggc 
46381 gctgcatttt ttccctgcaa aggaggtcca gaaagcagac ccagcccaat gtctgtgcat 
46441 ctgcctggag aaggcctcct gaggagcaca ccaacactct ccccttccat gttggccttc 
46501 tctgcactat tcaggttctt gcctctttgg aattgtgtgt atgtggaggg aacatctgga 
46561 ggtgtcactt aatttttcat gttacatctg gacctgtcac atagccacaa ccgtaactgt 
46621 acatggagaa tcatgtcgtg tgtgctgtgg acaagccaag aggtggcttt gtgtgctctt 
46681 ctgccacaca gaagagctgg gtgctgccca gcgcgggtgc tgggtgaccc cggaggaaag 
46741 agtcattccg agggcctgaa gaagggagga atcggtggct tgtaaggtgc actcccactt 
46801 cctgtggttt tcacaggcga tgacatgagg actggagatg aaatgaagca aaaccttctg 
46861 gggtcacttg agctatttct tttgagtcca cgtctctggg atggcatggc ctgtcagcca 
46921 cacccgtttc tctcctgttg gcagctccat tctgcagggg agcaagtgtc cagtcctggg 
46981 agcactggca gcagggatgc tcggaatgcc caacctgtcc tggggcctgc cttcgcctct 
47041 gcccctcagc tgccttgcca tctctcttcc tcctctttat ttttgtctgt ccctatggct 
47101 gtcagtgggg tcttgagtca cctatgtcat catatcagga acattgtgtg ttccaaagtg 
47161 tgcttggtgg aacacttgtt acttgaaaca gtgctttgta aaatggatcc tggggctagc 
47221 agctggagat gccgcatcct ctatctcctt ctcgtggatt tgcaattcag attctctcta 
47281 ttaaccctga agaatcccat aataaatggt ttaattttgc ttagtagttt cgcccaaacg 
47341 tttttgatca cggatccttt cgtatgaact cccaagaaac acttattagc agtgtttggt 
47401 gaatcacact ctgggtaaag cgggcatgaa caacatatta ttcgtaggaa tgctgtgacg 
47461 tcaaactcac cagtgaaaaa tctggcatca gagggttatg gaaaaatgcg tgtgcttttt 
47521 ttttttaata caaatatgac attggtgcag atcagacgtt cccacccctc tcaccacaaa 
47581 aagaaccttt ttagtccttt cctcaaggga ttccatgttt taaaggcttt taactaccta 
47641 acaaactaca atttattgaa taataattta tttctttgag ttttatttat caaagacatc 
47701 caacggccaa ttagggcttc tgatcagtcg agataaccat agttgccaca ctgagttgat 
47761 atgactccag cccaaagcaa ataaacttaa tgttttattt agtctgggaa cgtcattgag 
47821 gtaaatgtat ggtttcctct gggctttttg atatcttgaa aatagcttgc agacccccct 
47881 tgctgccttc atggagtgca ggataggtgg gaagcaggac catggatgga aatgcagagg 
47941 acgcacagct gttaggaggg cgggagggac ggggggaggg ggtggtgact ggggctacac 
48001 caggacactg gatagtagag actaagtggc cacagccact gtgaagactg ggggtccatg 
48061 ggttggggtg tccagatcct ggtgaacaag caagaagcca ctagccaaag gcacattcat 
48121 ggtgcaccct taatgtgggc aaaacctggg cacagcacct tccctctgta cgctggctgc 
48181 attgcaccca cccttccccc agcacccaga agaggcccag ggaggaggac agggcaaagg 
48241 ggagtgaggc aggcttcttt gacgggatac tttccccaga cctccctgat gcatttttca 
48301 atcacgagct actatgaggt ggatgtattg ccatggccac cttcagtgtc agtagagccc 
48361 ctgggagatt aaggcccctg aggattctgc ggtcggcttg ctcctcacct cgaaggagag 
48421 acctctgccc agggagctca tccttgctta gcagaacctg tgccactgtt catatggaaa 
48481 ggatgaggca ctagagccac attcaggctc atcaagcagt tctgtctcag atgatggaga 
48541 gccatgcgct ccctaaatag agggtttcct ggtgcattgg agatgggatt cgacatectg 
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48601 taatccactt tccacctgac tttacagaag agctgttact cgaagtaaga gagagggctc 
48661 ctccgcagga gaaaagtgct caggccatgc cctttgcagc cccattcagc ctccatgggc 
48721 tgggtcctaa gcccctcctg gacccccagt tactctcaca gccccagacc cctgcttagc 
48781 aatggccttg aactaaaacg catgatccta ctttttgcct cccctccctc attagtcagg 
48841 acagcatttt cccaaaagat acaatctgtg tctggcggaa gcctacttgc aaaaataaat 
48901 tcttccctga acattttcca ggcccatttt agaaataaat ctgggtgatg ccttttactc 
48961 atttccaaag ccaaacaacc tgaaccccga ggccttgttc tgcagtgtaa atgttacgag 
49021 gaggagggaa tcaggatcgg ctcctcagaa gaacccatct ccacctctag cagggaggct 
49081 gagcagtccc cacagctgta cccaccgcaa accactgctg ctcagatctg gggacacagg 
49141 tggatggagc tcacagcctg ggtcccagga cagcacagcc gcacccatcc ctctcaccct 
49201 gccaggctct cccgcagcag ggagccttcc tctggtttct gggtgcactg acggctgcag 
49261 cagctacaat ctccctgata cccattcatt cattcactca ctcactaaat atttctgctg 
49321 agcgcctgtt atgtgtcagg caccgtgcta agagctagaa atacagcagt gagcaagtgc 
49381 ccacggggcc tggcatatat ggggcactct actcacagca caatgaccaa gtgcatgagc 
49441 tggaaggcag gaggaaggga gaaaaggaag cgtctgtgca gtcagctccc tctgcttttg 
49501 tgtgcaccgt ctcattgaat cttcccaacc gcattccgtg attagaaggt gtccccatcc 
49561 ccatattcac agaaaaacag ccaagttcca aaggaaaggg gagaggtgga ttgatacagt 
49621 ttgtctgatt tcagagctcc gaactctctc atccacgcac actacaaagg cactcacaag 
49681 ctcacaaaca cacagggaca catattcacc cccacacata cctggactgt gcctccctcc 
49741 tccgggggta cccagtgatt gagccatgcg ttaacattct acttttcttt ctttttcttt 
49801 ttttttagac agagtctcgc tctgtcgccc aggctggagt acagtggtgc gatcttggct 
49861 cactgcaacc tctgcctcct gggctccagc gattctcctg tctcagcctc ccgagtagtt 
49921 gagattacag gcatgtgcca cctcacctgg ctaatttttg tattttttag aatagagaca 
49981 gggtttcacc atgttgatca ggctggtctc aaactcctga cctcaggtga tccgcccgcc 
50041 tcggcctccc aaagtgttgg gattataggc gtaagccact gtgcccggcc aacattctac 
50101 tttttttcat cataaccatg ttctgagcac ccgctccggc tcctgtatgc tcctgtctgg 
50161 gcccaggaac ccagagcaac aggggagact gagtccctgc ccttcagtca tgtgctgtct 
50221 agaaggaagc cactggctgt ggctactaaa cgtgtgaatt gtggttaact ctacattgac 
50281 atgtgctgtg agtgtgaaat acacactgag tttctaagac ttagtaagaa gaaaagaatg 
50341 caaaatattc catactttaa ccaggttgag taccccttat ctgaaatgct tcagaccaga 
50401 agtgttttgg atttcgcatt tgttctgttt ttggaatact atttgtgtat acataaatga 
50461 gatatcttgg ggatgggacc taagtttaaa catgaaattc atatatgttt tttatacacc 
50521 gtatatacat agcctgaagg taacttcata taaaatttta aataattttg ttcatgaaac 
50581 agaagttttg gctgcatgtt gactgtgacc tggaacatga ggtcaggggt ggaactctcc 
50641 ttttgcggtg tcatgtcagc actcaaacgc tttcggattc tggagaattt caaagctcag 
50701 attttttaat tcgggatgct caacctgtct tgatcacatg ttgaaattat acattttttg 
50761 gatatattgg gttaaagaaa aagtatgtaa aagataattt tccatgcttc ttttaaaaaa 
50821 atgtggtggc tacttacctg tttgagcttg caggagcgta ttttaaaaac gtggctacta 
50881 aaatatgtaa ggtcacatat gaggctcacg tgtgtgctca catgatattg cttagggaca 
50941 gtgtggctgg agatggaaag acaaaccata agcaagcgta cgcatgcgga acaagggtgc 
51001 ctgcgagtgg gggtacaggt cagggaggaa gcagtacagg gcgggtggat gccagcagtc 
51061 aggatgggtg gcctcacggg tggaggggaa gccaggagag ggaggagccg ggagtgagcc 
51121 cagtatgagt catttcttat cctgcgaggg actggggcct gggtcctgaa ggaaggaagg 
51181 aaggagatcc agggagctgt ataggcgcct gcctttattc tttccccgac tgagtgaggg 
51241 gtcctggcgc tgcaggggcc ctctggaggt caccacctcc atccagcctc tcagtttcat 
S1301 agggtcagtg tctgccacgg tcctgatctc gggttggcaa acggtcctca ggccctgccc 
51361 tcgcgggtgc tcactccaca ggtgcattaa tccagtgcct gtgatagtcc aagcctgctc 
51421 taagcgctgc ggaggtggtg gtgaccaatc aaagtgccca ctccagggca gctcacagca 
51481 agggtggggt ggctggagac agacaacagg caaaggaaca actacatgtg tcaggtgtgg 
51541 tcggtgaaaa gaaacatcaa gagggagcat ggatgggtgg gagctgtgtt agagagaagg 
51601 tgacatttca ggagggacct gaatgaattc agggatggag ccatgatcac accaaga 
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ggcacgagca atgggactta tcgctgctga tgttaacctt gatctcttgg ttcaggtggt 60 

gcctgccagc tgtctccact gtggagttac tatttttcct tttccccatt ttattcatca 120 

gaagccagtc actaagcgag gtcaaactcc aggacagggg aattaagtgc caccttctgg 180 

agagggagca ttcacattta ttacttggga tccttctgta aggaagagct gtttctcctc 240 

taaaaaactc tttaatcctt ttaagcctca atttcttaat tgtgaaatgg ggctaatacc 300 

tgtatccaac caagggagta gttagaaggt aacatgatag gtggaaagca cttaacatag 360 

gcaaaatgtt attatcagga atgatcgaga gacccatcca actatctgaa ggagtcactt 420 

aactctactg tactgcagcg ctgtaaagtc tgcatctttc actgggggta aaggccccca 4 80 

gtccctgaga cgggccagtt tggagacagg ctggtttttt ctctgttctc ctgagagccc 540 

ttcagatgag aagggaggtc tggagacaga atgccaaaag cccattaaag gcacggcctt 600 

gcatttcaga gagggagcag gtctagagaa gaaccagagg agctcagctg agatatggtg 660 

tatggattgg attttggtag aagatgggaa gaaccaaaca cctgagaaac cactttgaag 720 

atcggggtca gagtaaggcc taacacatag ttggctccca gtaattattg gttgattgaa 780 

cagctcaaag agcaactcga ccaagaacac tggactggga gtccagttac ttggatcttg 840 

cattcctgat ttatttttat tttatatgta ttttttctat ttttttgaga cgaagtctca 900 

ctcactctgt cgcccaggct ggactacaat ggcacgatct cggctcactg caaactctgc 960 

ctcccaggtt caagcgattc tcctgcctca gcctctcgag tagctaggat tacaggcatg 1020 

caccaccacg ctggctaatt tttgtatttt tagtagagac ggggttttgc catgttggcc 1080 

atgctggtgt ccacctcctg acctcagttg atcttcctgc ctcagccttc caaaatgttg 1140 

ggattacagg cgtgagccac cgtgcctggc cgtgatttat tttttttgtg tatgtttgtt 1200 

tttgtcaact tgctgtgtga ccttaagcaa gttacttaac ttctctgggc ttcactttcc 1260 

atggatgaac attgtaaaga ggctggagag agatgaggac taggtacagg ctttagagga 1320 

gagccaccgc cccggacttc tccctctgtc accccgcttt ccatgaccct ccttgcctga 1380 

ctttgtgact ccttgcctcg ctatcaaaac aagtgctgca atctcagtgc tttccaagag 1440 

ccctgcattg ttagaaactt cccagcacgc agcaaaggct gctgcaatac tcgctctgcc 1500 

tgcctttgcc ctgcgcttcc tacttaccct ccttttgttt ctcccaaaca tctgtccctg 1560 

actatgctca tctcatgttt gtcctcagct gctgaaaggg ccacgtttgt tttcattaca 1620 

M- aataagacca ccgagtgggc tcctggcgtg ggggcgggag cagccgcgcg cagtcttcag 1680 

rl aggcagcccc ccaggctgtc tctggagggt gtgtctctgc ttccctttcc ccgtgtttat 1740 

5j tttcagacga agccaagtgg cccgggggga ccctccggac tcccagcctt cagagaggag 1800 

fa* ggcagctcgg gctttcgccg cagtgcttcc tgcccgtcac gtgtgtgctc ctagccgggg 1860 

Iff tcgggggagc tggtatcttg gcccttctgg gaggacgcgc acagcccgag gaggcagagc 1920 

£i cccagacggg aatgggcttt tcagaggtgg ggtgcgggcg aggggacgat gcattatttt 1980 

taatatttga tttatttttc caactggact tcttcccggg gctctttctg ggcccagctg 2040 

£ cctttgtgat cccgcgcccc ggtcctcggc ctctcacctc cagcgccggg gcgccccctg 2100 

J V; ctgtcggaag cggctgtgac cgggcagagg tgctatctgg gactctgggt tctcagcccg 2160 

'y gggacagcga accgaggggc agatgatcca tcagaaaaga gccggcactg cccagccccg 2220 

jil cgcccctgcc cctgcctttt tccgggagcg cgccgcgccg cacccgctac ggccgcttga 2280 

ccccatcttt gagcccggcc ccaagctctg ggaccgtcgt gcccctcatc aaggaagagc 2340 

L caaggacccc aaggagaagg tcaggagcgg cggtgtggat gtcccttggc tgcaggcccc 2400 

P gccgcgcact cccttcagtc cttcccttct ctagggacca ggtagcatca gtgcctggat 2460 

y,, ctcggccttg tgtgccctgc tccctgcccc acctactaag aaccaagtct ggttcaccgg 2520 

ctcccaagag ctggaaccca ttctcagcta gctgggggcc caggccaccc cttccctcca 2580 

fsj gacctgtgtg ccttctgccc tggctccagg gccccccaca ccgtgaccag ggcgggatcc 2640 

m ctatggggct ggccagtcgg caccgtgcca ggcccacagt gccctgggcg tccatggaag 2700 

12 tcgttctgtg tctttaaaat cagaaggaag acattaacct ttaggctgaa gaaaatgttt 2760 

P tagtacacag caataactta tttgtcttta tccaacagcc ataaaatata actttaaata 2820 

f|| ttctattgat agagaaagga gttcatgaag gcagaaatgc ctggggccca cgaacatccc 2880 

a 9tgtggccc tggacgggac atcatgctgg gcaacacagc taaaatgcgg gtgaagacca 2940 

gatttcttgc acatggcggt gacgggatgc tccctagaga gcttcaagtg gattctttgc 3000 

tttttatttt ctctcttaat aaaaatgtat gatgtttaca ttgtcagaga aaaaaaaaaa 3060 
aaaaaaa 3067 
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gcaatgggac ttatcgctgc tgatgttaac cttgatctct tggttcaggt ggtgcctgcc 60 

agctgtctcc actgtggagt tactatttty ccttttcccc atttwattca tcagaagcca 120 

gtcactaagc gaggtcaaac tccaggacag gggaattaag tgccaccttc tggagaggga 180 

gcattcacat ttattacttg ggatccttct gtaaggaaga gctgtttctc ctctaaaaaa 240 

ctctttaatc cttttaagcc tcaatttctt aattgtgaaa tggggctaat acctgtatcc 300 

aaccaaggga gtagttagaa ggtaacatga taggtggaaa gcacttaaca taggcaaaat 360 

gttatkatca ggaatgatcg agagacccat ccaactatct gaaggagtca cttaactcta 420 

ctgtactgca gcgctgtaaa gtctgcatct ttcactgggg gtaaaggccc ccagtccctg 480 

agacgggcca gtttggagac aggctggttt tttctctgtt ctcctgagag cccttcagat 540 

gagaagggag gtctggagac agaatgccaa aagcccatta aaggcacggc cttgcatttc 600 

agagagggag caggtctaga gaagaaccag aggagctcag ctgagatatg gtgtatggat 660 

tggattttgg tagaagatgg gaagaaccaa acacctgaga aaccactttg aagatcgggg 720 

tcagagtaag gcctaacaca tagttggctc ccagtaatta ttggttgatt gaacagctca 780 

aagagcaact cgaccaagaa cactggactg ggagtccagt tacttggatc ttgcattcct 840 

gatttatttt tattttatat gtattttttc tatttttttg agacgaagty tcactcactc 900 

tgtcgcccag gctggactac aatggcacga tctcggctca ctgcaaactc tgcctcccag 960 

gttcaagcga ttctcctgcc tcagcctctc gagtagctag gattacaggc atgcaccacc 1020 

acgctggcta atttttgtat tttwagtaga gacggggttt tgccatgttg gccatgctgg 1080 

tgtccacctc ctgacctcag ttgatcttcc tgcctcagcc ttccaaaatg ttgggattac 1140 

aggcgtgagc caccgtgcct ggccgtgatt tatttttttt gtgtatgttt gtttttgtca 1200 

acttgctgtg tgaccttaag caagttactt aacttctctg ggcttcactt tccatggatg 1260 

aacattgtaa agaggctgga gagagatgag gactaggtac aggctttaga ggagagccac 1320 

cgccccggac ttctccctct gtcaccccgc tttccatgac cctccttgcc tgactttgtg 13 80 

actccttgcc tcgctatcaa aacaagtgct gcaatctcag tgctttccaa gagccctgca 1440 

ttgttagaaa cttcccagca cgcagcaaag gctgctgcaa tactcgctct gcctgccttt 1500 

gccctgcgct tcctacttac cctccttttg tttctcccaa acatctgtcc ctgactatgc 1560 

tcatctcatg tttgtcctca gctgctgaaa gggccacgtt tgytttcatt acaaataaga 1620 

ccaccgagtg ggctcctggc gtgggggcgg gagcagccgc gcgcagtctt cagaggcagc 1680 

cccccaggct gtctctggag ggtgtgtctc tgcttccctt tccccgtgtt tattttcaga 1740 

cgaagccaag tggcccgggg ggaccctccg gactcccagc cttcagagag gagggcagct 1800 
cgggctttcg ccgcagtgct tcctgcccgt cacgtgtgtg ctcctagccg gggtcggggg ' 1860 

agctggtatc ttggcccttc tgggaggacg cgcacagccc gaggaggcag agccccagac 1920 

gggaatgggc ttttcagagg tggggtgcgg gcgaggggac gatgcattat ttttaatatt 1980 

tgatttattt ttccaactgg acttcttccc ggggctcttt ctgggcccag ctgcctttgt 2040 

gatcccgcgc cccggtcctc ggcctctcac ctccagcgcc ggggcgcccc ctgctgtcgg 2100 

aagcggctgt gaccgggcag aggtgctatc tgggactctg ggttctcagc ccggggacag 2160 

cgaaccgagg ggcagatgat ccatcagaaa agagccggca ctgcccagcc ccgcgcccct 2220 

gcccctgcct ttttccggga gcgcgccgcg ccgcacccgc tacggccgct tgaccccatc 2280 

tttgagcccg gccccaagct ctgggaccgt cgtgcccctc atcaaggaag agccaaggac 2340 

cccaaggaga aggtcaggag cggcggtgtg gatgtscctt ggmtgcaggc cccgccgcgc 2400 

actcccttca gtccttccct tctctaggga ccaggtagca tcagtgcctg gatctcggcc 2460 

ttgtgtgccc tgctccctgc cccacctact aagaaccaag tctggttcac cggctcccaa 2520 

gagctggaac ccattctcag ctagctgggg gcccaggcca ccccttccct ccagacctgt 2580 

gtgccttctg ccctggctcc agggcccccc acaccgtgac cagggcggga tccctatggg 2640 

gctggccagt cggcaccgtg ccaggcccac agtgccctgg gcgtccatgg aagtcgttct 2700 

gtgtctttaa aatcagaagg aagacattaa cctttaggct gaagaaaatg ttttagtaca 2760 

cagcaataac ttatttgtct ttatccaaca gccataaaat ataactttaa atattctatt 2820 

gatagagaaa ggagttcatg aaggcagaaa tgcctggggc ccacgaacat cccagtgtgg 2880 

ccctggacgg gacatcatgc tgggcaacac agctaaaatg cgggtgaaga ccagatttct 2940 

tgcacatggc ggtgacggga tgctccctag agagcttcaa gtggattctt tgctttttat 3000 

tttctctctt aataaaaatg tatgatgttt acattgtcag agaaaaaaaa aaaaaaaaaa 3060 

ctcgtagggg gggcccgtac ccaatcgcct gtgatgatgg tatac 3105 
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Lyl488 Protein Sequence ^ff^^D ND^ i Oj^l/Q 

ELLKSIWYAFTALDVEKSGKVSKSQLKVLSHNLYTVLHIPHDPV 

ALEEHFRDDDDGPVSSQGYMPYLNKYILDKVEEGAFVKEHFDELCWTLTAKKNYRADS 

NGNSMLSNQDAFRLWCLFNFLSEDKYPLIMVPDEGDEGNHPSPEPVPSTKHPNKTQDP 

PESPKQSVPKSCWGRLWEPDRALPGVGAGNTTCCSYQAFLLLLQVEYLLKKVLSSMSL 

EVSLGELEELLAQEAQVAQTTGGLSVWQFLELFNSGRCLRGVGRDTLSMAIHEVYQEL 

IQDVLKQGYLWKRGHLRRNWAERWFQLQPSCLCYFGSEECKEKRGI I PLDAHCCVEVL 

PDRDGKRCMFCVKTANRTYEMSASDTRQRQEWTAAIQMAIRLQAEGKTSLHKDLKQKR 

REQREQRERRRAAKEEELLRLQQLQEEKERKLQELELLQEAQRQAERLLQEEEERRRS 

QHRELQQALEGQLREAEQARASMQAEMELKEEEAARQRQRIKELEEMQQRLQEALQLE 

VKARRDEESVRIAQTRLLEEEEEKLKQLMQLKEEQERYIERAQQEKEELQQEMAQQSR 

SLQQAQQQLEE VRQNRQRADEDVE AAQRKLRQASTWKHWWQMNRLMHP I EPGDKRP 

VTSSSFSGFQPPLLAHRDSSLKRLTRWGSQGNRTPSPNSNEQQKSLNGGDEAPAPAST 
PQEDKLDPAPEN " 

Lyl488 DNA Sequence (mRNA) 

1 gaactgctca agtccatctg gtacgccttt accgcgctgg acgtggagaa gagtggcaaa 
61 gtctccaagt cccagctcaa ggtgctgtcc cacaacctgt acacggtcct gcacatcccc 
121 catgaccccg tggccctgga ggaacacttc cgagatgatg atgacggccc tgtgtccagc 
181 cagggataca tgccctacct caacaagtac atcctggaca aggtggagga gggggctttt 
241 gttaaagagc actttgatga gctgtgctgg acgctgacgg ccaagaagaa ctatcgggca 
301 gatagcaacg ggaacagtat gctctccaat caggatgcct tccgcctctg gtgcctcttc 
361 aacttcctgt ctgaggacaa gtaccctctg atcatggttc ctgatgaggg tgatgaaggg 
421 aaccacccga gccctgaacc agtgccctct actaaacacc caaacaagac ccaggatccc 
481 ccagaaagtc ctaaacagag tgtcccaaaa agctgctggg gcaggctctg ggagccagat 
541 agagcactcc ctggtgttgg tgctggcaac accacctgct gcagctacca ggccttcctt 
601 ctcctgctcc aggtggaata cctgctgaaa aaggtactca gcagcatgag cttggaggtg 
661 agcttgggtg agctggagga gcttctggcc caggaggccc aggtggccca gaccaccggg 
721 gggctcagcg tctggcagtt cctggagctc ttcaattcgg gccgctgcct gcggggcgtg 
781 ggccgggaca ccctcagcat ggccatccac gaggtctacc aggagctcat ccaagatgtc 
841 ctgaagcagg gctacctgtg gaagcgaggg cacctgagaa ggaactgggc cgaacgctgg 
901 ttccagctgc agcccagctg cctctgctac tttgggagtg aagagtgcaa agagaaaagg 
961 ggcattatcc cgctggatgc acactgctgc gtggaggtgc tgccagaccg cgacggaaag 
1021 cgctgcatgt tctgtgtgaa gacagccaac cgcacgtatg agatgagcgc ctcagacacg 
1081 cgccagcgcc aggagtggac agctgccatc cagatggcga tccggctgca ggccgagggg 
1141 aagacgtccc tacacaagga cctgaagcag aaacggcgcg agcagcggga gcagcgggag 
1201 cggcgccggg cggccaagga agaggagctg ctgcggctgc agcagctgca ggaggagaag 
1261 gagcggaagc tgcaggagct ggagctgctg caggaggcgc agcggcaggc cgagcggctg 
1321 ctgcaggagg aggaggaacg gcgccgcagc cagcaccgcg agctgcagca ggcgctcgag 
1381 ggccaactgc gcgaggcgga gcaggcccgg gcctccatgc aggctgagat ggagctgaag 
1441 gaggaggagg ctgcccggca gcggcagcgc atcaaggagc tggaggagat gcagcagcgg 
1501 ttgcaggagg ccctgcaact agaggtgaaa gctcggcgag atgaagaatc tgtgcgaatc 
1561 gctcagacca gactgctgga agaggaggaa gagaagctga agcagttgat gcagctgaag 
1621 gaggagcagg agcgctacat cgaacgggcg cagcaggaga aggaagagct gcagcaggag 
1681 atggcacagc agagccgctc cctgcagcag gcccagcagc agctggagga ggtgcggcag 
1741 aaccggcaga gggctgacga ggatgtggag gctgcccaga gaaaactgcg ccaggccagc 
1801 accaacgtga aacactggaa tgtccagatg aaccggctga tgcatccaat tgagcctgga 
1861 gataagcgtc cggtcaccag cagctccttc tcaggcttcc agccccctct gcttgcccac 
1921 cgtgactcct ccctaaagcg cctgacccgc tggggatccc agggcaacag gaccccctcg 
1981 cccaacagca atgagcagca gaagtccctc aatggtgggg atgaggctcc tgccccggct 
2041 tccacccctc aggaagataa actggatcca gcaccagaaa attagcctct cttagcccct 
2101 tgttcttccc aatgtcatat ccaccaggac ctggccacag ctggcctgtg ggtgatccca 
2161 gctcttacta ggagagggag ctgaggtcct ggtgccaggg gcccaggccc tccaaccata 
2221 aacagtccag gatggaacct ggttcaccct tcataccagc tccaagcccc agaccatggg 
2281 agctgtctgg gatgttgatc cttgagaact tggccctgtg ctttagaccc aaggacccga 
2341 ttcttgggct aggaaagaga gaacaagcaa gccggggcta cctgccccca ggtggccacc 
^ 2401 aagttgtgga agcacatttc taaataaaaa ctgctcttag aatgaa 



Figure 27 



TMpred Report for Ly1488 (sesx&tto \ io,*u*0 



ELLKSIWYAFTALDVEKSGKVSKSQLKVLSHNLYTVLHIPHDPVAIiEEHF 
RDDDDGPVSSQGYMPYLNKYILDKVEEGAFVKEHFDELCWTLTAKKNYRA 
DSNGNSMLSNQDAFRLWCLFNFLSEDKYPLIMVPDEGDEGNHPSPEPVPS 
TKHPNKTQDPPESPKQSVPKSCWGRLWEPDRALPGVGAGNTTCCSYQAFL 
LLLQVEYLLKKVLSSMSLEVSLGELEELLAQEAQVAQTTGGLSVWQFLEL 
FNSGRCLRGVGRDTLSMAIHEVYQELIQDVLKQGYLWKRGHLRRNWAERW 
FQLQPSCLCYFGSEECKEKRGIIPLDAHCCVEVLPDRDGKRCMFCVKTAN 
RTYEMSASDTRQRQEWTAAIQMAIRLQAEGKTSLHKDLKQKRREQREQRE 
RRRAAKEEELLRLQQLQEEKERKLQELELLQEAQRQAERLLQEEEERRRS 
QHRELQQALEGQLREAEQARASMQAEMELKEEEAARQRQRIKELEEMQQR 
LQEALQLEVKARRDEESVRIAQTRLLEEEEEKLKQLMQLKEEQERYIERA 
QQEKEE LQQEMAQQ S RS LQQAQQQLEEVRQNRQRADEDVEAAQRKLRQAS 
TNVKHWNVQMNRLMHPIEPGDKRPVTS S SFSGFQPPLLAHRDS SLKRLTR 
WGSQGNRTPSPNSNEQQKSLNGGDEAPAPASTPQEDKLDPAPEN 

Black = intracellular , Red = Transmembrane, Blue = Extracellular 

Lyl488Rp3-329A5Chr6 has 756 amino acids and 1 Transmembrane Domains 

Transmembrane Domain 1: 199 - 220 Score: 1.3061 
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Lyl449 and Lyl480 matches Lung cancer associated 
polynucleotide sequence SEQ ID ID,4*1<S 

cttaaagagg taatttagcc atcattctta tgccagcaga tataaataaa cttggaccca 60 

tctggtcttc agctaaacct gagacatttt aaagtgcatg gacagccatg gacagcaggc 120 

cctgctctaa caggggatgc aaggcatgga gaaagacaat cagtacccaa gctcagccac 180 

agaagacagg agtcactcat ataacttgtg tttagaagtt tttggtagcc acgcacactt 240 

tctgaaatca cactatctgg tggtttaatc atatttttaa agacagaatc cctgagtgct 300 

gagcagattc tcaaaacaca tttagaatcc ctgaaattag aaagatcaat gacaaaatat 360 

ctgtcagcca ggccacaaac aggtgtaaaa ttatgaaagg agtggttgga tgtgccaagt 420 

ttggtaaagt ggtgactgca tctgagaaag aggctgtgag gctgaactct tggtggcttc 480 

cttctgtaac ttccagaggg agtcttcaac acaggccccg tgctcgtagg aatacggtag 540 

cacctatgta ggaagtgcgt ggagttttct gtcttctttc tgtgtgattt ttggcctttt 600 

tatcagcact tctcccctcc cagsagcctg gggatgccaa acatccagaa tgtgatggga 660 

caagatgggg gcaggggcct cacctccctg cagaggtccg gccaggtctc cttgtccctg 720 

gacaatctcc tgagcctctc tgcttggtgg agcaggcacc tgtgtgcaga attcccactg 780 

tggccagcac gaggaagtct tttctagtga aaatgtgtct tgtggtcagg aataattatc 840 

ctttcccctg tagccaccaa ggagggcaaa tagagaaagg taacctaatt gaaggattgg 900 

tcatgtgaaa agggctacat ttgggaagct gggaaaggcc tccaggcttc tagagcagct 960 

agcttgggct ggattctcay acccaggctg ccccttggat tgttctaccc aagcttttcc 1020 

ctggggtctg ggctcactcc ataaggtaag gtgcctttta ccttatggtc cttctttagc 1080 

aggtaacaaa ggagcatcag gggcaggctg ccctggtggc atcacactgg ctagtgaggc 1140 

cgtgaatatc ttgtccccca gcagggccga cagtttctat cacagaaaac agtgtgttca 1200 

gtggtgaaaa tcgttgcatg catgttttca tctgagcgtg tccttctccc atactcccta 1260 

tcagccagcc ctgcctgtag ctgctgtatg gtgattgcac ttggacatca gtccaatgac 1320 

tgcaagtcgg cctggatttt cacttgcaga ggctacagct gcattgtcag gtctcccagc 1380 

cctgcagaga gctccctcca ctggttagca gtgtgttgtg ttttccattc atttcagaag 1440 

agctacattg tgtcactgga catttttaaa aactgtgatt tttaataaaa atttaaaatt 1500 

tgaaaaaaaa aaaaaaaaac ctcgggggta acttttrggg gggccggggc ccwtgcgttt 1560 

1561 
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Homo sapiens chromosome 17, clone RP11-956N15, complete sequence (bp1-10 000) 



1 tgtatgtatg tgtgtggtgt gtgaaggtgt atgtggtatg tgtttggtat gtgtatatgt 

61 gtactgtgtg gtgcatgcat gttctgtgta tttgtgtgca tgtgtatgta tgtgtgcccg 
121 tctctgcata tgtgtatgta tatgtgtgca tgtgggtaca ggtgtatgta catgcatgca 
181 catgtgtgca catgtgtgaa tgcatgtgta catttgtgca tgtgcttatg tgtgtggatg 
241 catgtgttca tgtaaatgca tgcatttgtg cccatgcaca tgtgtatgta catgtgtgca 
301 tatacatgta tgcactgaca tatatgcatg tgtgcatatg tacacgtgtg catgccttct 
361 gtgcatgtgt gaatgcatgt gtgcatgtgg atgtatatgt gtgcatgtct gcgcatgtgt 
421 gtaacctcct tagaacaggc agaaattggg gctctggaat cctttctttg cctaccgcag 
481 ttccttttag gctgtcttca tagagaaagg gatagctcaa aacccacagc cctgctttgg 
541 cctgatgggg gatttctggg tctcctcagt ctgtctttta ttaggcaggc catgggctgt 
601 caggccctgg ctgggtagat gctctgctca tgaataaaag atagaggcag ggcaggacag 
661 ggccctccct gatgggcctc cccgccgttc tgtgtggtgg aatctcacag tcaactttga 
721 cctgtgcaca tccacttttt tttttgagat aggatcttgc tctgtcaccc agctgggagt 
781 gcagtggtgc catcacagct catttcagcc tcgacctccc aggctcaagc aatcttcctg 
841 ccccagcctc ccaagtagct gggactacag gtgcatacca ccacacccgg ctaatgtttt 
901 gatttctgca gagctaaagt cttactatgt tgtccaggct ggtctcaaac tctgggctca 
961 agagatcctc ctgccttggg ctcccaaact gctgggatta caggcatgag ccactgcacc 
1021 cggccaacac attcactttt ttgggacatg gcagggactt aatgttttag aaaacattta 
1081 gctacccctt tgacaatgct gcttgacact atttgacagt gtgacttacc acatcctaat 
1141 aacttccatg aaaatcatca caaagcaaat tttaaatttt tattaaaaat cacagttttt 
1201 aaaaatgtcc agtgacacaa tgtagctctt ctgaaatgaa tggaaaacac aacacactgc 
1261 taaccagtgg agggagctct ctgcagggct gggagacctg acaatgcagc tgtagcctct 
1321 gcaagtgaaa atccaggccg acttgcagtc attggactga tgtccaagtg caatcaccat 
1381 acagcagcta caggcagggc tggctgatag ggagtatggg agaaggacac gctcagatga 
1441 aaacatgcat gcaacgattt tcaccactga acacactgtt ttctgtgata gaaactgtcg 
1501 gccctgctgg gggacaagat attcacggcc tcactagcca gtgtgatgcc accagggcag 
1561 cctgcccctg atgctccttt gttacctgct aaagaaggac cataaggtaa aaggcacctt 
1621 accttatgga gtgagcccag accccaggga aaagcttggg tagaacaatc caaggggcag 
1681 cctgggtatg agaatccagc ccaagctagc tgctctagaa gcctggaggc ctttcccagc 
1741 ttcccaaatg tagccctttt cacatgacca atccttcaat taggttacct ttctctattt 
1801 gccctccttg gtggctacag gggaaaggat aattattcct gaccacaaga cacattttca 
1861 ctagaaaaga cttcctcgtg ctggccacag tgggaattct gcacacaggt gcctgctcca 
1921 ccaagcagag aggctcagga gattgtccag ggacaaggag acctggccgg acctctgcag 
1981 ggaggtgagg cccctgcccc catcttgtcc catcacattc tggatgtttg gcatccccag 
2041 gctcctggga ggggagaagt gctgataaaa aggccaaaaa tcacacagaa agaagacaga 
2101 aaactccacg cacttcctac ataggtgcta ccgtattcct acgagcacgg ggcctgtgtt 
2161 gaagactccc tctggaagtt acagaaggaa gccaccaaga gttcagcctc acagcctctt 
2221 tctcagatgc agtcaccact ttaccaaact tggcacatcc aaccactcct ttcataattt 
2281 tacacctgtt tgtggcctgg ctgacagata ttttgtcatt gatctttcta atttcaggga 
2341 ttctaaatgt gttttgagaa tctgctcagc actcagggat tctgtcttta aaaatatgat 
2401 taaaccacca gatagtgtga tttcagaaag tgtgcgtggc taccaaaaac ttctaaacac 
2461 aagttatatg agtgactcct gtcttctgtg gctgagcttg ggtactgatt gtctttctcc 
2521 atgccttgca tcccctgtta gaggagggcc tgctgtccat ggctgtccat gcactttaaa 
2581 atgtctcagg tttagctgaa gaccagatgg gtccaagttt atttatatct gctggcataa 
2641 gaatgatggc taaattacct ctttaagttg tttttttgtt tgtttgtttt gacagagtct 
2701 cgctgtgatg cccaagctgg agtgcagtgg catgatcttg gctcactgcg acctccgttt 
2761 cctgggttca agcgattctc ctgcctcagc ctcccgagta gctgggacta cagacacatg 
2821 ccaccatgcc cggctaattt ttgtattttt agtagagatg gggttttacc atattggcca 
2881 ggctagtctc aaactcctta cctcaagtga tccacccgcc tcggcctcct aaagtgctga 
2941 gatgtctctt taactttcaa gaggtacagc aaaactgaaa tcaatgggtg atgacttctt 
3001 aaagatcaaa catgaagatg aggattagtt aagaccagaa aatcattctc cgatcttgta 
3061 tcttcaactc aaattcacac tccaaaaggc ctgtttgtgg tgaggctcag ggtctttggc 
3121 ttgttctgaa ttaatacttt ctttggaaac tctttagatg gctttgatcc ctgtgctgcc 
3181 ccccattgca gacacagaaa gcgacagggg attctggcgg caccctctct gtggaggaga 
3241 ggaaggtgtg gaccacgttc agaggaaggc agcctgaagc tgtcctcagt gagatgctgg 
3301 gggatgggcc tcttgcccct gctgtgcagc ctccatgcag ggctttattt accagtcacc 
3361 aggtcttcaa gacagtctgc aggagagagg atttcagggg gaaagtaagc caagccaatt 
3421 cattctcatg gtcccctttt atcacaaaca tgtaagtctt ccatctcata acagagacag 
3481 caaaagcagt atccaggcct tttttttttt tctttctttc tttccttaac tggctacatg 
3541 cttagaaact gcactggtca aacttgattt tctttttaaa gcctcaaaac atttttattg 
3601 tcaggaaagc ttttcagtgg ccagggatca gtctcatggc cgtagaagca gccaaattcc 
3661 tctgcctttg ccttcccttc aggagtcaca tgctaaggca tccttgggca tttggaaaag 
3721 gccgcttggg gtgagagtgc tctaggccac tctgcaatgt ccctgggccc gatgagtaac 
3781 aaatgcaccc cgggacccag agaagtggaa agacatgaaa gggatttgga aacagatcgt 
3841 aaaaataacc tgtatgaaaa tcacacagac aagaaaacaa acaaatccag ctagcccctt 
3901 ggatcctgcc atccgtcttt tctcccggga ctgcctttgc tttgtggagg gtgagtcctg 
3961 ggacaggctt tgggtgacaa gttgtgggca tgttccatcc aagcctggct ctcccgactg 
4021 gcagagggag gtggctgaca atcaattcct ctggatacat tttcctgtga ttgctcctgt 
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4081 gatcaaaggc agccaccgac aggtgcggga 
4141 tgcgcagccg cagttccact gctgcgaatc 
4201 gatgcattct tgtttgtcaa tgtgcttgtg 
4261 caggctgagt accagtggtg ctgctccctg 
4321 cgtggtccct gtgcccccca ccccagggtg 
4381 tactgtagac ctaaataacc catatctgca 
4441 caccctaaag taggagactt tttgcattag 
4501 tatgcggcac acaatatgct tctgtgccac 
4561 ttttagcaaa acaggtgtct gaaaatactt 
4621 tgactcataa tcttgttaag agaagaaata 
4681 ctctaatcga tggctctgga ttttctgaaa 
4741 ctcaaaacat gtcatggcag gagcaaagac 
4801 agccagtcct tactgccact gctggccagg 
4861 aagcattaac agttcgcagt ctctaaacaa 
4921 acagacatcc tgtacagatc agttggaagt 
4981 cttagaagtt atttttctta ttttgcaaga 
5041 gaaatacaat aagtgaaaaa gtgaaacaga 
5101 cattcagcca ttttgaccta tatcaatggc 
5161 gaacagccat agaacgcgca tcaggaccaa 
5221 cttctacaaa tcaaaatggg tagctttgtc 
5281 tagctttcaa tagtgagcaa aatgataaaa 
5341 ctgtggtcat gtctggggac acgggcctcc 
5401 ggccaccaag tgtgttccga gaggggatat 
5461 gtccacccag agggacagcc tctgtcttgc 
5521 cttcagcatc tttgcaaaac agggggctct 
5581 agagcacctt tggatcttgc tctgggggtt 
5641 tgcagggaaa caggaataca gactcgtcct 
5701 aaatctctcc tgtacatcca tgggagatgc 
5761 ttcagcagca ggagggtgat gcagtgccct 
5821 ctgccactct gctttcgggc cagagtgcag 
5881 ctcctcctct cccctcatcc ctgcactcaa 
5941 caatccctaa aggcggccta tctggagccc 
6001 aattttctgt tctcttttaa agtttccagt 
6061 tttatcctgg gttcctggca cagagctcct 
6121 tgaaagcagc atcttttgtt actcataata 
6181 taatgagact atttctctcc ctccctccct 
6241 ctctctctgt ctgtctctat atatcacctt 
6301 aaatgaaaaa acaaagcaaa acaaaacaaa 
6361 ccttcattga atccaccaca ttctagaagc 
6421 tgagaaacta cttcgacacg agagggctta 
6481 caggcccctg tgccctcctt gcttagaatt 
6541 atcataggac acacttattt ttagcttatt 
6601 ataaggtctg atagggacag ggtctgaggc 
6661 ctgcatagca ggagctccgg cagtggccac 
6721 ggatgtgccg ctcagaggaa agccttggag 
6781 tcaccaaggt cctgggaatt catgttttat 
6841 tgagccgcat ctagagcctc cctgactctc 
6901 cattcggcag aggcccccac tggagggctg 
6961 agagcaccaa cttctctcca tttgcaaggg 
7021 agcatgactt ggctctggaa aacactacag 
7081 cttagggcca gtgtttcctt cctggctttc 
7141 caggactctg atgttgtgtt tcctctttgg 
7201 gccatcccaa aatatgccag attgggatat 
7261 attgtagttt cggataggac taactgacct 
7321 attcctctgg gaaggagacc cttcctgtag 
7381 gactgggcat tggagctgca atgggcctga 
7441 ttccactggt tttacaccct cctcatatct 
7501 tctagccaga ttttatttgt cctgtcattt 
7561 attataaaag catgttgttt tggccacttc 
7621 ataaacatgt aaaactagta aaattggtaa 
7681 aatttggttt ctaggtccag ctgaagagtc 
7741 ctctggctgt cttacaaggg tgagggatgc 
7801 gctgggaagt ccctttgaat tccctccagg 
7861 cagtgaggga ccccctgcat cttctggagg 
7921 cagcccttgg cgcttctgag ttctccttgt 
7981 tttcattcaa ctgcacctca ttgaatttct 
8041 aagaaattat ttaggcagat agtgagggta 
8101 aaaaagcagc tcctaaatca tgtcttttat 
8161 aagcatagct aagcaagcta aaagctcaca 
8221 aaagggatcc ctggagccag ggatattcaa 
8281 tgccacgtgt gcagtaaaaa agcaggcaac 
8341 gagatcgaga ccatcctgac taacacggtg 



gcggttagtt caccacgtca tagtagtaat 
cgggcctgtt ttccacactg gaatggaaga 
aaaaattgct gatcacaagc agggcctctg 
gtggctaact ctagggcctg cagctcacag 
accacaaggc tgtcaaaaat agtgtcggcc 
ctttaacaag acctcccggt gattcatgtc 
aactgggagt gttttagccc caggttcaga 
aggcaacacg aagaggctgt cactctgtga 
ggcaaatgtt ctgcagatat agtcaacatc 
cgaacataac tttaattctc cctttccacg 
atgggctgtt gatatgggtt cttaggaaag 
ccagggcttg tctatgatcc accttaatat 
gaggcactaa tgtcagtaaa ctatgggcaa 
acacttacag acacttctgt tttgtcttac 
aagtactaaa accctgataa tcgatcctca 
tttctggata tattgcaagt gggtgaagtt 
atgtgaggtt aaattgtagg aaaccgtcca 
aatttcatat gattcaatgt cgacacaaga 
gactcactca tcattataag agcagagtgg 
aaatggcatc aaaatacatg gtttctctta 
ctgctcgaat tgatcagtgt ggtcagccat 
cagggtgtca ctgagagggg ctttgtctct 
ggtggccaag cctctggtac aggagcatgt 
tcctaaactt ctacagtcaa agagaacaca 
gctccaactg tggggatcca aatggtacaa 
catctgccca gcagccaagg agcaaacaaa 
caggaaaaac acctggtcag gtctcctagg 
tgcctccagg aagctgggac acagctgtgc 
tgcgtgccca gcttcctggc ctgccctggc 
attctgctct gctccttgta cccagtgacc 
ccttgcttct tagcctgcag gtctcagctg 
ctgctggaag tgtgaggctt ccctccccat 
gtgtgatttt atataaagag atatatggtc 
aaaacccttg gaatttcctg attgttgggc 
aacccccttc aaccatacct gggtttgtgc 
ccttctctcc cttcatctac ctccctcccg 
ggagagagtg gctgtttact actgaggaaa 
acagagaata tactaggacc acctgtagtc 
tccaatgtgc ttctgtgctg caggcaactg 
aaaaacccca tatatacacg tgatatcctc 
cccactggag tcatcccttt cctaagacac 
gtttattgcc tgtctgcccc agccccaact 
ttctgtgagg cttgcacctg gggtggcagc 
cgaccctgtg aatgacccag ggctctcaga 
ctgaggttgt gcagggagcc aggggacatc 
cttgggcttg ttgatcttta aacaggaatt 
agagaacctg gagagcccag gaaatttctc 
ggactattag gaaaccgttt ttcaggatca 
ctttccccac tggtgtcttt aatcacccca 
tggaaggctt cctgactgat tcctgctttt 
acgctgcctc cttgagtgac ctgcacaggg 
aaagtgagag ggggaaggag tgaagcatgg 
tgattatttt gagttgaaaa tattggatac 
gtctcttcat gcctgcagca agccataaag 
caggatgaga aaatgcccct tatcatcaga 
gtaaacatat ttaacacagt atcccttatc 
cctagcgcct tctctagaaa atttactcct 
ctccttaaat gtatggttct ttgtctaaag 
tttagacttc actgtcttgt gaggatcccc 
gcttttcttt tgttaatctg gcctggtatc 
cattaagact aaaaaggggt tggaggtaat 
tgtctgcctt ctcagggctt tgtcattgag 
accacaccct ggaactgatg acaatgcctg 
gcctcaagca atagcctggg ttaccaggac 
ggccctgctc caaggaatct. tttccttttt 
tatgtttaga caaagtgatt cagagctaga 
acagagtcct ctgtaaggtt tccttgcaat 
aacaaaaagc agcctgaaaa atcaagctgc 
taggtaaatg ccggcagctg tgtcaataga 
catggaggtt ccctcttcca ctttccttgt 
aaggctgagg cgggcggatc atgaggtcag 
aaaccccgtc tccactaaaa aatacaaaaa 



8401 aattagctgg gcgtggtggc gggcgcctgt 
8461 agtatggtgt gaacccggga gacagagctt 
8521 cagcctgggc gacagagtga gactccgtct 
8581 catggcgtcg gccaaggaga gacctcatct 
8641 gtttcttttc gtgttatgta aatggcctgg 
8701 tcagactcaa gctcgtctat aaaacccttg 
8761 gggagccctt gtctctctgc aggaaagaga 
8821 tctgctctta aactcacttc ttgtgtgtct 
8881 gagctcaggt attaccccag acaaacacag 
8941 cctcccccaa ccctgtttga ggtcactact 
9001 ttgttgggaa ataaatcaaa gtctgataaa 
9061 cattgtcatg atttcctttt gactggggct 
9121 cagaactcac tgaaatgcac ccctgcagag 
9181 attgcctgcc tgtgaacctc actcaggttc 
9241 ttagaaagaa tactctctta aggagactgt 
9301 gaacatccag tgaatatcac gtagactgct 
9361 ctctggaaat aactgcccac tgttacgtgc 
9421 cccctatcct gattccttga gtcacaagaa 
9481 cctcactgca cccacctggg agccaagtgg 
9541 ctcttcccac ccacctttca ctcctcactg 
9601 gaagtacata gatcagcatc aacgttaaat 
9661 taattcaaag tttttgcatt tgtatcacaa 
9721 acaagaaggg aagccttgca aagggagctg 
9781 gggtcatagg ctgccacgtg cccaagggct 
9841 ctacctgcca atggttctta cgattctagt 
9901 aggaataaac atcctcttag atccatgagt 
9961 ccctgttgca ctaacacttg gggttggaga 
10021 tcctgagcca cggcagcagc aggggcaggg 
10081 ttgtggctgc cccctgggtg gtttctagat 
10141 aattgtggta gagtgatact cccagggtgg 
10201 tctgcagcat ttggaatagc cctcatcttt 
10261 aaggcaaacc tgattccgca aagcatcaaa 
10321 atcacaaacc acatcgattc tcagctgtgt 
10381 aactggagat ggcggcttgg ggtcaccgtg 
10441 ggcaaaactg tactcctaat tttcagtggc 
10501 tgaacatcac ctaagaataa tgccaggtgt 
10561 actaagcgag acgtatgacg gcagtcaggg 



agtcccagct actcgggagg ctgaggcagg 
gcagtgagcc cagatcgcgc cactgcactc 
caaaaaaaaa aaaaaaaaaa aagcaggcaa 
gcataataaa agattagggt ggggcagcca 
ctgtccaacc gatctttggg ccctatgtaa 
catttccaca cggagccgga agaaccactt 
gcttttctct tttctgttgc ctattaaacc 
gcatctttga tttccttggc gtgaggcaac 
ctgcttcaaa agtttcagaa catctcccca 
agaaacactg taaccttcag aaatatagct 
gagctgtgat ttctaatatg gggcatgggc 
tgtgccctgg agctgactcc tccaagcctg 
cggactgggg tgggaaccaa ggggaaagac 
catcttaaat ctccaggcgg gaaaatacgt 
tctactgctc tactttgctg taattctggt 
ttccctcccc ggtctgcccc tggtggacca 
gtccatggag gacagaggtc ggcccagcag 
tacaccacga gcccaaaggg tattcagcag 
gaggtggtgc ccctggctgc gcaccctgct 
ttaaacacac accacaaaga agaatcgaca 
gcaaaaactc ccccactgcg agttcatttt 
tgttttaaac tctaccctcc tcacccttct 
caggctggac caaagaacac tttcagtaaa 
gccctcccca gggtccagtg ctgccagtgc 
ttaggcactg ggagaattaa gccaagactt 
agcacttccc tcttgctgac acttagctct 
agatgctccg gggcccaggc caggctcctc 
aagcatatac ccacagtttg cctgctgcct 
acagaatgag attatttccc tgtcactcac 
ggaggggagg tctcggaatc ccaggtggca 
cagaggctct ggcaatgtca gctgtatctg 
tccatggcca ggatttgtga tgtcctcctg 
gggctctgtg tgcacagttc agtaccaggg 
ctatgcacgc acaagcaggc tgctgagcct 
aatgcatttt ccttccttgc tgagctgccc 
actttaaaag cataacagca cctgggatcc 
tggactgagg ccaccacttc aaggatgagg 
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